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CHAPTER  I 


INTRODUCTION 

1 . 1  Introduction 

Industrial  robots  which  are  defined  as  computer  controlled 
mechanical  manipulators  have  become  increasingly  important  in 
industrial  automation  in  recent  years.  They  can  be  programmed  to 
perform  the  tasks,  without  human  intervention,  of  arc  welding, 
paint  spraying,  assembly,  foundary  operation,  etc. 

A  manipulator  can  be  described  as  a  series  of  links 
connected  at  joints.  Typically,  they  have  three  to  six  joints 
(three  to  six  degrees  of  freedom)  with  a  gripper  or  end  effector. 
Each  joint  is  driven  by  an  actuator  which  is  connected  to  sensing 
devices. 

The  control  system  design  for  robot  manipulators  is 
basically  the  problem  of  controlling  a  multi-input  nonlinear 
system.  The  general  objective  is  to  achieve  a  very  accurate,  fast 
and  smooth  tracking  while  rejecting  a  broad  class  of  disturbances , 
including  parameter  variations. 

The  simplest  form  of  control  used  for  manipulators  is  the 
open  loop  control  Ul]-f3]),  where  the  entire  input  sequence  is 
predetermined  and  is  applied  regardless  of  any  errors  which 
develop.  The  trajectory  is  preplanned  or  prerecorded  and  the 


inputs  do  not  depend  on  output  measurements.  Disturbance  rejection 
and  accurate  path  tracking  can  only  be  achieved  by  making  the  robot 
extremely  rigid.  This  approach  which  is  coraonly  used  today  implies 
precise  gear  trains  and  actuators  as  well  as  very  strong  structural 
members.  The  speed  of  such  systems  is  generally  limited  by  the 
force-producing  capacity  and  speed  of  the  actuators,  and  by  the 
excitation  of  high-frequency  structural  modes  of  the  linkage. 

Linear  feedback  controls  have  also  been  designed  for 
manipulators.  The  most  widely  used  method  is  the  independent  joint 
control  ((4), [5)),  where  basically  each  joint  is  independently 
controlled  by  a  linear  PD  controller.  Gravity  compensation  is  also 
provided.  The  'computed  torque'  technique  is  an  illustration  of 
the  independent  joint  control.  We  will  present  it  in  detail  in 
chapter  IV. 

A  pseudo-linear  feedback  law  with  nonlinear  pre-  and  post¬ 
processing  of  measurements  was  developed  by  Raibert  and  Craig  [6] . 
In  practice,  this  kind  of  linear  control  is  easy  to  implement, 
works  relatively  well  and  is  reliable.  However,  adequate 
disturbance  accommodation  requires  high-power  actuators  and  this 
technique  becomes  less  effective  when  high  speed  and  accurate 
tracking  are  required. 

In  order  to  improve  performances  (speed,  accuracy, 
transient  response,  etc.),  more  sophisticated  methods  have  been 
developed  in  recent  years. 

Adaptive  control  schemes  based  on  the  model  reference 
principle  have  been  proposed  ([7)-[9]).  Lee  and  Chung  [10] 


«.|  ta(  ((I  <it  4 


presented  an  adaptive  control  based  on  the  first  order 
linearization  of  the  nonlinear  dynamic  equations.  In  general, 
these  adaptive  strategies  are  characterized  by  complex  algorithms 
which  may  lead  to  complexity  comparable  to  that  required  for  real 
time  computation  of  inverse  kinematics. 

Nonlinear  feedback  laws  have  also  been  proposed  by  many 
authors  ((111, [12]).  Young  (13)  used  sliding  mode  theory  to 
develop  nonlinear  switching  feedbacks. 

Since  the  mid-1970's,  the  technique  of  linearizing  a 
nonlinear  system  via  nonlinear  feedbacks  has  been  used  and 
developed  into  what  is  known  today  as  'differential  geometric 

I 

control  theory'  ((14) -(25)).  The  earlier  work  (1976)  was  conducted 
by  Hemami  and  Camano  [21]  who  applied  this  technique  to  a  simple 
locomotion  system  which  resulted  in  uncoupled  subsystems.  In  1982, 
Freund  (22)  through  a  'global  nonlinear  feedback  law'  obtained 
uncoupled  second  order  systems.  This  method  is  based  on 
partitioning  the  robot  dynamic  equations.  But  one  drawback  of  this 
method  is  that  the  maober  of  inputs  should  equal  the  number  of 
outputs . 

Recently,  a  theorem  giving  necessary  and  sufficient 
conditions  to  linearize  a  nonlinear  system  via  a  coordinate 
transformation  and  nonlinear  feedbacks  has  been  given.  This  result 
is  due  in  part  to  Brockett  (23)  ,  Jakubczyk  and  Respondek  [24]  and 
finalized  by  Hunt,  Su  and  Meyer  ([18], (25)). 

In  this  thesis  our  main  work  will  be,  using  this  feedback 
linearization  theorem,  to  show  that  a  given  n-joint  robot 
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manipulator  can  be  linearized  through  a  coordinate  transform  and 
nonlinear  feedbacks.  The  resulting  linear  system  is  composed  of  n 
uncoupled  second  order  time  invariant  subsystems.  The  control 
problem  is  then  reduced  to  controlling  these  n  uncoupled  second 
order  subsystems.  A  general  expression  of  the  explicit  feedback 
controls  will  be  derived.  Two  examples  will  be  given,  a  two  and 
three  joint  robot  manipulator,  with  accompanying  computer 
simulation  study. 


1 .2  Organization  of  the  Thesis 


In  chapter  II,  we  discuss  the  robotics  problem  in  terms  of 
trajectory  planning  and  control  approach  to  be  followed  for  a  given 
robot  taak. 

Chapter  III  deals  with  the  manipulator  dynamics.  To  design 
a  robot  control  system,  one  needs  a  mathematical  model  of  the 
robot.  Such  a  model  can  be  obtained  by  Newton-Euler  or  Lagrangian 
mechanics. 

Chapter  IV  is  the  main  part  of  the  thesis.  We  present 
first  a  robot  controller  using  the  'computed  torque'  technique. 
Then,  after  giving  some  theoretical  background  needed  for  further 
developments,  we  state  the  feedback  linearization  theorem  [25]. 
Using  this  theorem,  we  linearize  the  manipulator  dynamic  equations 
and  derive  a  general  form  of  the  nonlinear  feedback  law.  Then,  we 
illustrate  by  two  examples  with  computer  simulations. 

Finally,  the  summary  and  conclusions  are  given  in  chapter 
V. 
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CHAPTER  II 


TRAJECTORY  PLANNING  AND  CONTROL 

The  basic  problem  in  robotics  is  planning  trajectories  to 
solve  for  some  specified  task  and  then  controlling  the  robot  to 
achieve  those  trajectories. 

The  trajectory  planning  consists  of  computing  a  desired 
sequence  of  positions,  velocities  and  accelerations  of  some  point 
which  is  usually  the  robot  hand.  This  is  in  fact  the  so  called 
kinematics  problem  which  will  be  discussed  in  the  next  section. 

The  control  strategy  to  be  adopted  depends  on  the  nature  of 
the  specified  task  itself.  For  example,  if  the  robot  is  permitted 
to  travel  between  the  initial  and  final  positions,  a  simple  point 
to  point  control  is  adequate.  In  this  chapter  we  will  give  a  brief 
survey  of  these  control  strategies. 

2 .1  Kinematics 

Kinematics  of  a  manipulator  ([26],  [27])  can  be  defined  as 
being  the  position,  velocity  and  acceleration  relationships  among 
the  links  of  the  manipulator. 

In  planning  a  trajectory,  one  is  primarily  interested  in 
the  position  of  the  hand  with  respect  to  the  work  space,  which  is 

called  the  hand  space  of  the  manipulator.  A  hand  configuration  in 

#  ♦ 
hand  space  consists  of  position  described  by  a  vector  P  and 


orientation  described  by  three  orthogonal  vectors:  the  approach 
vector  a,  the  orientation  vector  o  and  the  normal  vector  n. 

A  robot  task  is  naturally  specified  in  terms  of  its  hand 
configuration  in  hand  space.  It  is  a  transformation  (matrix: 

(n(t),  o(t),  a(t),  p(t)),  called  the  forward  kinematics  transform 
which  relates  the  hand  frame  to  the  robot  base  frame. 

To  achieve  the  desired  configuration,  one  has  to  command 
the  joint  actuators.  To  do  so,  we  must  be  able  to  find  the 
corresponding  joint  coordinates  (in  joint  space)  from  the  desired 
hand  configuration.  This  inverse  problem  is  referred  to  as  the 
inverse  kinematics  transform,  or  arm  solution. 

The  direct  kinematics  has  a  straightforward  solution, 
whereas  problems  can  occur  when  computing  the  inverse  kinematics. 
The  solution  may  not  be  unique  and  singularities  may  occur, 
depending  on  the  geometrical  configuraion  of  the  arm. 

2 .2  Control 

As  mentioned  previously,  the  control  strategy  to  be 
considered  depends  on  the  assigned  robot  task.  These  strategies 
are  classified  as  follows: 

2.2.1  Point  to  point  control 

If  there  are  no  path  constraints,  if  the  work  space  is  free 
and  if  coordination  with  external  moving  objects  (e.g.  conveyors) 
is  not  required,  positional  control  can  be  used  to  ensure  that  the 
hand  passes  through  the  specified  corner  points  of  the  path.  No 
control  over  position  is  required  between  points.  The  path  of  the 


hand  in  such  control  schemes  is  unpredictable  and  the  robot 
exhibits  a  tendency  to  stop  at  each  point. 


In  many  cases  we  require  that  the  hand  moves  smoothly  along 
a  prescribed  path  (path  tracking).  This  involves  many  computations 
of  corresponding  desired  joint  coordinates  (through  the  inverse 
kinematic  transform).  Two  cases  occur  here:  off-line  path 
control,  in  which  computations  are  performed  before  the  motion 
starts  and  on-line  path  control,  in  which  calculations  are 
performed  in  real  time. 

2.2.2  Off-line  path  control 

If  the  work  space  is  free  and  no  external  coordination  is 
needed,  the  hand  path  and  the  corresponding  desired  joint 
coordinates  can  be  specified  before  the  motion  is  to  start.  To 
accomplish  a  smooth  motion  (smooth  accelerations),  some  techniques 
are  available,  for  example  the  path  control  polynomials  technique 
[26]  . 

2.2.3  On-line  path  control 

When  external  coordination  is  needed,  then  path  points  and 
desired  joint  coordinates  have  to  be  computed  on-line.  This 
constraint  is  computationally  very  difficult,  and  such  a  strategy 
is  in  practice  used  when  accurate  path  tracking  is  important  and 
the  manipulator  moves  slowly.  There  are  control  techniques  which 
compromise  between  full  on-line  path  control  and  point  to  point 
control,  in  order  to  achieve  smooth  and  accurate  tracking  (e.g., 
joint  interpolated  control  ( [26]  ,  [28] ) ) . 
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2.2.4  Collision  free  path  control 

If  the  work  space  is  changing  and  not  free,  then  collision 
free  paths  must  be  followed.  This  makes  path  control  very  complex 
if  on-line  path  control  is  required. 

2.2.5  Force  control 

In  some  situations,  the  manipulator  is  constrained  by 
external  positional  constraints.  Two  common  situations  can  happen: 
guarded  motion,  when  the  manipulator  is  about  to  contact  a  surface; 
and  compliant  motion,  when  the  manipulator  is  in  continuing  contact 
with  a  surface.  In  chose  cases,  one  has  to  control  the  forces 
instead  of  the  positions  and  we  will  be  relating  forces  in  hand 
space  to  torques  (and/or  forces)  in  joint  space. 

Some  controllers  can  simultaneously  control  forces  along 
certain  coordinate  axes  and  control  positions  along  the  remaining 
axes.  They  are  referred  to  as  hybrid  controllers. 


CHAPTER  III 


DYNAMICS  OF  A  ROBOT  MANIPULATOR 

A  robot  manipulator  is  a  mechanical  structure  which 
consists  of  a  series  of  links  connected  at  joints.  When  several 
joints  move  simultaneously,  the  motion  and  the  torque  applied  at 
one  joint  have  a  dynamic  effect  on  the  motion  at  other  joints. 

This  results  in  high  coupling  among  the  joints  and  makes  the 
overall  system  dynamics  very  complex. 

For  purposes  of  dynamic  control,  one  needs  a  mathematical 
model  of  the  manipulator.  Such  a  model  can  be  obtained  by  deriving 
the  robot  equations  of  motion  using  either  the  Lagrangian  or  the 
Newton-Euler  approach  [29]  .  The  resulting  dynamic  equations  for  a 
n-joint  manipulator  are  highly  nonlinear  and  coupled.  They  have 
the  following  form: 

D(q)q  ♦  H(q , q )  +  G(q)  -  T  (3.1) 

where  q  is  an  (n*l)  vector  of  the  actual  joint  positions  (n 
joints). 

D(q)  is  an  (n*n)  inertial  forces  matrix. 

H(q,q)  is  an  (n*l)  corriolis  and  centrifugal  force  vector. 
G(q)  is  an  (n*l)  gravitational  force  vector. 

T  is  a  (n*l)  generalized  input  vector  (torque  and/or  force). 
For  a  six-joint  manipulator,  computing  these  equations  is  a  very 


difficult  task.  The  result  is  hundreds  of  algebraic  terms  which 
makes  the  on-line  computation  of  the  control  torques  a  major 
problem  in  robot  control. 

Next,  we  derive  the  dynamic  equations  of  a  two  joint  and  a 
three  joint  manipulator  that  will  be  used  later  for  purposes  of 
control . 


3.1  Dynamic  Equations  of  a  2-Joint  Manipulator 


We  first  consider  the  two-rotat ional-joint  robot 
manipulator  of  Figure  3.1,  where  the  first  link  has  a  length  4.^ 
and  mass  ra^  ,  the  second  link  has  a  length  l ^  and  mass  m^  .  Both 
masses  are  considered  to  be  centered  at  the  link  mid-points.  The 
load  of  mass  m^  is  placed  at  the  end  of  the  second  link. 

Using  polar  coordinates  the  potential  energy  can  be  expressed  as: 

*1  *2 
V  *  nijg  —  sin  ^  +  ^gUjSin  ®i  +  ~  s in( 9j +02 ) ) 


♦  m^g(£j  sin  9^  +  i^s in( 9^ +  02 ) ) 


(3.2) 


The  kinetic  energy,  which  is  the  sum  of  the  kinetic  energies  of 
link  1  (m^ ) ,  link  2  (ra^)  and  the  load  M^,  is: 
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where  9^,  0^  are  the  joint  angles  and  9^  ,  9^  are  the  cor¬ 
responding  angular  velocities. 


The  Lagrangian  L  3  K  -  V  is  Chen, 
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The  dynamic  equations  are  obtained  from  the  Lagrangian  equations 
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where  T^  is  the  torque  (or  force)  applied  at  each  joint.  The 
final  dynamic  equations  are: 
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3.2  Dynamic  Equations  of  a  3-Joint  Manipulator 


Consider  now  the  three  lomt  manipulator  of  Figure  3.2.  It 
consists  of  one  rotational  joint  which  rotates  in  the  (x-y)  plane 
(joint  variable  $).  It  also  has  two  prismatic  joints.  One  allows 
the  hand  to  extend  in  the  (x-y)  plane  (joint  variable  r)  while  the 
other  lets  the  hand  translate  along  the  z-axis  (joint  variable  z) . 
The  arm  has  a  length  l  and  a  mass  ra^ .  The  load  with  a  mass 
is  placed  at  the  end  of  the  arm  as  shown  in  Figure  3.2.  We  suppose 
that  r  >  1/2. 

In  the  cylindrical  coordinates  the  potential  energy  is: 


£ 
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(3.8) 


The  kinetic  energy  is: 
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The  lagrangian  L  is  then, 
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If  we  denote  by  F  the  force  applied  by  the  actuator 

K 

along  the  r  direction,  by  F  the  force  applied  by  the  second 


W!»i 


prismatic  joint  along  the  z-axis,  and  by  T  the  torque  applied  by 
the  rotational  joint;  we  obtain  the  following  dynamic  eouations: 

Fr  *  ("l  *  i  '  K  *  l  mR}r’2  +  i  V^2 

♦  (m.  ♦  m_)g  sin  $  .  (3.11) 


3  *  3  2  *’  3  * 

T  -  2^  ♦  27  ♦  (a^  ♦  j-  mR)r  *  -  - 

♦  r  m_£(  4-3 r) ’♦  ♦  (m,  *  mD)gr  cos  j  ^  m  g  cos  $  (3.12) 


CHAPTER  IV 


ROBOT  CONTROLLER  DESIGN 

In  this  chapter  we  present  two  different  control  concepts. 
In  section  4.1  we  present  an  independent  joint  control  method,  the 
'computed  torque'  technique. 

In  section  4.2  we  propose  a  nonlinear  control  approach 
based  on  feedback  linearization.  A  control  algorithm  with  explicit 
nonlinear  feedback  is  derived.  We  also  illustrate  by  two  design 
examples  with  computer  simulations  to  evaluate  the  performance  of 
the  proposed  control  method. 

4 . 1  Controller  Design  with  the  'Computed  Torque'  Technique 

Most  of  the  control  approaches  found  in  today's  commercial 
robots  use  the  method  of  independent  joint  control  ([4), [5]).  An 
illustration  of  these  methods  is  the  'computed  torque'  technique 
([30], [31)),  also  called  the  'inverse  problem'  technique 
([32], [33]).  It  is  basically  a  linear  proportional  and  derivative 
control  law. 

As  seen  in  chapter  III,  the  actual  equations  of  motion  of  a 
robot  are  in  the  form: 


D  ( q )  q  +•  H(q,q)  +  G(q) 


■  T 


(4.1) 


,t 


The  principle  of  Che  'computed  torque  technique  is  as 
follows.  Let  Dc(a),  H^iq.q)  and  G^Cq)  be  the  computed  counter¬ 
parts  of  the  actual  D(q),  H(q,q)  and  G(q).  Let  the  control  t 
(force  or  torque)  be: 

T  »  Dc(q)[qd  +  Kp(qd-q>  *  K^-q)]  +  Hjq.q)  +  Gc(q)  (4.2) 

where  qd>  qd  and  q"d  are  respectively  the  desired  joint 

position,  velocity  and  acceleration  vectors. 

K  and  K  are  constant  scalar  feedback  gains  (PD  action). 

P  v 

If  we  assume  that  the  computed  Dc ,  Hc  and  Gc  are  equal  to  their 
actual  counterparts,  i.e.: 


Dc(q)  M  D(q) 


Hc(q,q)  -  H(q,q) 


Gc(q)  •  G(q) 


(4.3) 


then  from  (4.1)  and  (4.2)  we  obtain: 


*  Vvq)  +  Kv(vq) " 0  • 

If  we  let  e  ■  q  -q  be  the  joint  position  error,  then  (4.4) 
q  d 

becomes: 


e  ♦  K  e  +  K  e 
q  v  q  p  q 


(4  .4) 


(4.5) 


The  control  problem  is  then  reduced  to  assigning  poles  with 
negative  real  parts  for  (4.5)  such  that  the  error  e^  approaches 
zero  asymptotically.  One  should  note  that  the  convergence  relies 
on  the  validity  of  (4.3). 


A 


From  (4.5),  the  component  e  of  the  error  vector  e 

q .  q 


has  the  following  character ist ic  equation: 


2  4  2 

s  ♦  K  s  ♦  K  ■*  +  2Cus 

v  p  n 


2 

ui  ■  0 
n 


(4.6) 


where 


«>  *  rK  is  the  natural  frequency, 

n  p  7 

K 

C  ■  v  is  the  damping  ratio. 

2/r 

p 


In  almost  any  robot  application  overshoot  is  to  be  avoided.  In 
th  case,  the  fastest  response  with  no  overshoot  is  the  critically 
damped  one  corresponding  to  £  equals  to  one.  Hence, 

k  -  2/r 

v  p 

and  the  error  time  response  is: 


-/Tt  -/r  t 

e  (t )  *  c  e  p  ♦  c  t  e  p 
q.  1  2 


(4.7) 


where  c^,  c^  are  constants. 

From  (4.7)  one  can  say  that  the  larger  the  values  of  the 


feedback  gains  and  ,  the  faster  the  asymptotic  convergence 

of  e  (t)  . 

<i 

As  an  example  we  will  use  this  technique  to  control  the 


three  joint  manipulator  of  figure  3.2. 


Example  4.1,1.  Three  tome  manipulator: 

From  Che  dynamic  equations  (3.11),  (3.13)  and  the  control 
law  as  given  by  (4.2),  the  actual  control  torque  T  and  control 
forces  FR,  are: 


fr  *  K.  *  4^rd  *  Vrd‘r)  *  V  vf)  J +  r 


3  •  2 

+  j  mRt*  +  (mL  ♦  mR)g  sin  $. 


((mL  ♦  y»R)r2  *  ±mRl(l-3r))(*d  ♦  Kp(^-4)  *  K/^-i)) 
3  *  3 

+  2(mL  +  -  mR)rr*  -  ^  mRtr#  *  ^tnL*wR^r  co*  ♦ 


*  «,  y  g  co«  ♦ 


(4.9) 


VZ  "  ^i/V^d  *  WZ>  * 


(4.10) 


where  q  *  (r.^.z)1  and  q  are  the  actual  joint  position  and 

velocity  vectors,  q,  ■  (r,  ,z,)C  and  q,  are  the  desired  joint 

d  a  d  a  a 

position  and  velocity  vectors,  r. ,z.  are  the  desired  accelera- 

d  d  a 

cions.  K_ ,  K  are  constant  feedback  gains, 
p  v 

A  computer  simulation  waa  then  conducted.  In  the  first 
part  of  the  simulation,  we  simulated  Che  motion  of  the  end  effector 


from  one  initial  path  point  to  the  next  desired  path  point,  or 
equivalently  through  the  inverse  kinematics,  from  an  initial  joint 
state  to  a  desired  one. 


Furwmgf?r*i  wr  ravnwimmii*  ih»**w  vm  miwmww 
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Initial  joint  state  (r  ■  1  m,  $  ■  0  rd,  z  =  -1  m),  and 
desired  one  (r  ■  0.5  m,  $  »  -0.6  rd,  z  ■  -0.2  m)  were  arbitrarily 
chosen.  For  the  entire  simulation  the  feedback  gains  and  Kv 
were  chosen  to  maintain  a  critically  damped  response,  i.e. 


K  -  2/T. 

v  p 


Figure  4.1,  4.2  and  table  4.1  show  the  simulation  results 


for  11^  *  5000.  A  steady  state  error  is  present,  due  to  the  fact 


that  (4.5)  represents  a  type  0  system.  High  torques  are  also 
required  for  the  motion. 

Table  4.2  gives  the  steady  state  errors  and  the  maximum 


value  of  force  F  (F  showed  higher  values  than  T  and  F  )  as  a 

C.  Lt  K 


function  of  the  feedback  gain  . 

One  notes  that  as  Rp  increases,  the  steady  state  errors 
and  the  convergence  time  decrease.  However,  the  torque  values  at 
the  beginning  of  the  motion  increase,  which  means  more  powerful 


actuators  are  needed.  Indeed,  the  position  error  e  and  its 

9 


derivative  e  are  maximal  at  the  beginning  of  the  motion.  The 
9 


inertial  term  is  the  dominant  part  in  the  expressions  of  the 
torques  given  by  (4.2).  This  implies  that  high  values  of  and 


R^  may  result  in  high  torques  until  e^  and  e^  are  small  enough 


to  cancel  out  the  effect  of  such  R_  and  R  .  Hence,  one  should 

P  v  ’ 


compromise  between  accuracy  speed  and  maximise  allowable  torques. 

One  way  to  maintain  good  accuracy,  fast  response  and 
smaller  torques  is  with  time  varying  feedback  gains.  One  can  first 
put  a  bound  on  the  maximum  permissible  torques  and  starts  with 


•mall  values  of  R_  and  R  .  As  the  errors  e  and  e  are 
Tv  q  q 


.V- 


*  *  •»  *  « 
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Table  4.1.  Simulation  of  ioint  motions  from  f  1  , 0  ,  —  l )  to 
(0. 5,-0. 6,-0. 2)  with  Kp  -  5000. 


THREE  JOINT  ROBOT  MAN  I  PULA  TOR  -  COMPUTED  TORQUE  TECHNIQUE 
CONSTANT  FEEDBACK  GAINS 


0 

1 . 00000 

0 

- ’  00000 

10000 

.51751 

-  58181 

-  19139 

20000 

.51419 

-  38581 

-  1 8582 

30000 

. 51418 

-  53582 

-  18582 

40000 

.51418 

-  58582 

-  13582 

50000 

.51418 

-  58582 

-  1 8582 

60000 

. 51418 

-  58582 

-  i 8582 

Table  4.2.  Steady  state  position  error,  maximum  force  F^  and 
convergence  time,  as  a  function  of  the  gain  K  ■ 


Feedback 

gain 

K 

P 

Ar 

(mm) 

4# 

(mrd ) 

Az 

(nra) 

(F  )max 
z 

(N) 

400 

51 

51 

51 

4762 

1000 

32 

32 

32 

1 1646 

2000 

22 

22 

22 

23029 

5000 

14 

14 

14 

56993 

10000 

10 

10 

10 

113403 

Convergence 
t  ime 

(s) 
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Figure  4.2 
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decreasing,  Kp  and  are  increased  as  long  as  Che  torque 

limits  are  not  exceeded. 

Using  this  technique  and  increasing  Kp,  K^  in  a  piecewise 

fashion  [34],  the  previous  hand  motion  of  the  three  joint 

manipulator  was  again  simulated.  The  maximum  allowable  torque 

(force)  was  set  to  be  5000  Nm  (N).  Simulation  results  are  given  in 

table  4.3  and  figures  4.3  -  4.5.  The  results  show  a  steady  state 

error  of  4  mm  for  joint  variables  r,  z  and  4  mrd  for  $  .  The 

convergence  time  of  the  motion  is  0.15  seconds.  Indeed,  this  time 

varying  technique  gives  an  appreciable  improvement  in  both  steady 

state  error  and  maximum  torque  requirements,  compared  to  when  we 

use  constant  feedbacks  K_  and  K 

P  v 

The  second  part  of  the  simulation  consisted  in  simulating 
the  end  effector  motion  of  the  three  joint  manipulator  along  an 
arbitrary  preplanned  path  in  joint  space,  which  is  given  in  table 
4.4.  The  time  varying  technique,  which  provides  a  better 
performance,  was  used  and  the  maximum  allowable  torque  (force)  was 
set  to  5000  Nn  (N)  .  The  required  time  of  motion  was  set  to  1.95 
seconds.  Simulation  results,  given  by  table  4.5,  show  a  relatively 
high  maximum  error  position  and  a  final  steady  state  error. 

To  summarize,  one  can  say  that  with  the  computed  torque 
technique  adequate  path  cracking  can  be  achieved  only  at  the 
expense  of  high  power  actuators.  Even  though,  the  time  varying 
technique  lowers  the  torque  requirements,  relatively  high  torques 
are  still  needed  for  acceptable  accuracy.  This  is  mainly  due  to 
the  fact  that  one  tries  to  stabilize  a  nonlinear  system  with 


Table  4.3.  Simulation  of  joint  motions  from  (1,0,-1)  to 

(0. 5,-0. 6,-0. 2)  with  time  varying  feedback  gains. 


THREE  JOINT  ROBOT  MAN  I  PULA TOR  -  COMPUTED  TORQUE  TECHNIQUE 
TIME  VARYING  FEEDBACK  GAINS. 


TIME 

R 

PHI 

Z 

V 

0 

1 . 000000 

0 

-  1  000000 

V 

k 

. 030000 

. 833972 

- . 177880 

-  731955 

' 

. 1 OOOOO 

. 564022 

-  . 523944 

-  . 266421 

. 130000 

. 304024 

- . 595973 

-  .  1 95992 

. 200000 

. 304014 

*  595986 

-  .  1 95985 

E 

. 230000 

. 304015 

- . 595983 

- . 1 95985 

■ 

. 300000 

. 504015 

- . 595985 

-  195983 

»*.  r»,r*. 


o'- 

*  V  -M  1 


Table  4.4.  Preplanned  path  in  joint  space 
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basically  a  linear  PD  feedback  law.  In  practice  equations  (3.2), 
on  which  the  convergence  process  relies,  are  only  approximately 
satisfied.  Hence,  it  is  expected  that  the  tracking  accuracy  is 
worse  chan  the  one  obtained  by  simulation. 

We  will  consnent  more  on  this  technique  when  comparing  it  to 
the  feedback  linearization  which  will  be  presented  in  the  nexc 
section . 

4.2  Feedback  Linearization 

The  control  of  a  n-joint  robot  manipulator  is  in  fact  Che 
problem  of  controlling  a  dynamic  system  described  by  a  set  of  n 
nonlinear  differential  equations.  Due  to  the  difficulty  of  the 
problem,  one  may  attempt  to  linearize  the  system  and  design  a 
linear  control  law. 

A  commonly  used  method  to  linearize  a  nonlinear  system  is 
the  first  order  linearization  (Taylor  expansion).  By  this  method; 
Golla,  Gang  and  Hughes  [5]  linearized  the  dynamic  equations  and 
designed  a  linear  state  feedback  controller.  However  as 
Vukobratovic  [35]  shows,  when  performance  requirements  (speed, 
accuracy,  etc.)  are  raised,  this  approach  does  not  lead  to 
satisfactory  results,  mainly  because  at  high  speeds,  higher  order 
terms  cannot  be  neglected. 

A  different  approach  to  accomplish  the  linearization  is  via 
nonlinear  feedbacks.  This  idea  has  been  used  in  the  past  years  and 
was  the  start  of  what  has  been  termed  'geometric  control  theory' . 
Precisely,  given  a  multi-input  nonlinear  system  of  the  form: 


„v 


X  * 


■V 


fi 


*4 
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X°(x) 


l  u.(x)X*(x) 


where  X°,X*,...,Xm  are  smooth  vector  fields  on  Rn  (or  a 

n-manifold  M  )  and  u.,...u  are  the  controls.  The  question  now 

1  rn 

is,  under  what  conditions  can  one  find  a  coordinate  transformation 
and  nonlinear  feedbacks  u.(x)  (i  ■  such  that  the 

nonlinear  system  is  linearizable .  Lately,  this  question  has  been 
answered  and  a  feedback  linearization  theorem  with  necessary  and 
sufficient  conditions  has  been  proved  [25]. 

In  this  section  we  will  present  this  feedback  linearization 
cheorem  and  use  it  to  derive  a  control  algorithm  with  explicit 
feedbacks  to  control  an  n-joint  robot  manipulator. 


4.2.1  Theoretical  background 

Lie  algebra  of  vector  fields  [36]  is  extensively  used  in 
geometric  control  theory.  In  this  section  we  give  some  definitions 
and  theorems  that  will  be  used  in  later  sections.  We  define  the 
Rronecker  indices  and  introduce  the  concept  of  nonlinear 
controllability. 


Definition  4.2.1 

The  Lie  product  of  two  smooth  vector  fields  X(x)  and 
Y(x)  is  defined  as: 

[X ,Y] (x)  -  X  (x)Y(x)  -  Y  (x)X(x) 

1  1  x  x 

where  X  and  Y  are  the  Jacobians  of  X  and  Y. 
x  x 


Definition  4.2.2 


Let  f  be  a  smooth  real  valued  function  of  a  manifold 


onto 


R 


1 


The  Lie  operator  L  is  defined  as: 


(L  f>(x) 


(Xf)(x) 


<X<x) ,df ( x ) > 


l  a . (x) 
r  i 


3  f  (x ) 
3x. 


i 


where  X(x)  *  (a,(x),...,a  (x))C. 

1  n 

for  notacional  convenience,  we  will  use  the  following  notations: 


(adX.Y )  -  [x,Y j ,  ( (ad°X,Y)  »  Y) 
(ad2X,Y)  -  [X,[X,Y]]  -  [X, (adX.Y)] 
(adkX,Y)  -  [X,(adk-1X,Y) ] 


Definition  4.2.3 

Let  f  be  a  smooth  real  valued  function  of  a  manifold 

1 

onto  R 

In  terms  of  operator  notion,  the  Lie  product  [x,Y]  can  also 
be  defined  as: 


[X  ,Y ] f  -  Y(x)(Xf)(x)  -  X(x) (Yf ) (x)  . 

Definition  4.2.4 

Let  G  be  a  finite  dimensional  vector  space.  If  the  Lie 
product  [x,Y]  defined  on  G  (X,Y€  G)  satisfies 

1.  [oXj  ♦  X2,  Yj  -  [«j,Y]  ♦  [X2,Y]  -  [xif«Y]  *  [X2,Y]  - 
®[xrY]  *  [X2,Y]  for  a  rea  l . 

2.  [x,y]  ■  -[y,x]  (ant icommutat ive) 

3.  [[X,Yj,zj  ♦  [ [ Y , Z  ]  ,X  ]  ♦  {[Z,X],Y]  -  0  (Jacobi  Identity) 


then  G  is  said  Co  be  a  Lie  algebra. 


Definition  4.2.5 

Civen  two  Lie  algebras  G^  and  G,, .  Given  also  a  one-to- 
one  mapping  t  from  G^  onto  G,  .  Then  t  is  said  to  be  a  Lie 
algebra  isomorphism  if  it  satisfies: 

1.  For  any  real  c^,  then 

l(c1Xl  ♦  c 2 X 2 )  -  CjKXj)  ♦  c2l(X2)  “  c1Yl  *  c2Y2 

2.  t[xrX2]  -  [  t(X1),t(X2)  j  ■  [Y,  ,Y2] 
where  Xj,  €G^  and  Yj,  ^2  '  ^2  ‘ 


If  l  is  also  differentiable,  it  is  said  to  be  a  Lie  algebra 
di f  feomorphism. 


Theorem  6,2.1 

If  t  is  a  Lie  algebra  di f feomorphism  from  a  Lie  algebra 

G,  onto  a  Lie  algebra  G  ,  Chen  Che  Jacobian  l  of  l  is  also 
1  l  * 

a  Lie  algebra  d i f feomorph i sm  from  G^  onto  G 2  with  ltlX,Yj  * 

[M-Vl  • 


Definition  4.2.6:  Involut iveness 

1  k 

Let  C  *  {x . X  f  be  a  set  of  smooth  vector  fields  on 

Rn  (or  Mn),  with  X 1 ( P) , . . . .xN P)  linearly  independent  for  some 
point  P  .  Then  the  set  C  is  said  to  be  involut  ive,  if  for  any 

X1  ,  X1  €C  there  exist  smooth,  real  valued  functions 

k  .  . 

such  that  the  Lie  product  [xl,X^](x)  ■  \  al^(x)X  (x)  ,  x  in  the 

l  91 

neighborhood  of  P,  i.e.  [x,y](x)  €  span  of  C  . 


A 


S'-/- 
*  *  a  '» 


. .VV*/.  V-  V-  ' 

•  3  la  *_a  *J|  •  »  ■  *>  ^  J  ’  •  a’j  •  »  *  a  *_» 


Definition  4.2.7.  Kronecker  indices 


as 


and 


The  Kronecker  indices  for  a  matrix  pair  (A, 3)  are  defined 
fol lows : 

Let  Rl  -  [ B,  AB,  .  .  .  ,  a'"!b] 

Let  tj  *  dim  R^  ■  dim  B 

l.  *  dim  Rl  -  dim  Rl  ^ 

i 


For  an  integer  j,  the  Kronecker  index  K.  equals  the  total  number 
of  l.  which  are  greater  than  or  equal  to  j,  i.e. 

K.  -  W  t.  1.  >  j} 
j  1  i  i  —  ‘ 

Example  4.2.1 


0 

0 

1 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

0 

0 

1 

Let  A  - 

0 

0 

0 

0 

,  B  - 

1 

0 

Hence  AB  * 

0 

0 

0 

0 

0 

0 

0 

l 

0 

0 

lj  ”  dim  B  -  2 

R2  -  (B ,  AB ) ,  dim  R2  -  4 

t2  *  dim  R2  -  dim  R^  "4-2*2 

The  Kronecker  indices  are: 

Kl  -  {#  *ii*i  >  U  -  2. 

K2  *  {#  ».!t.  >  2}  -  2. 

K .  *  0  for  i  >  3 . 


Definition  4.2.8.  Local  controllability  of  a  nonlinear  system 
Given  the  multi-input  nonlinear  system  of  the  form 

m 

x  -  X°(x)  *  l  u. (x)Xl(x)  (4.1  ) 

i*  1  1 

where  X^,X^,...,xm  are  smooth  vector  fields  on  Mn .  Let  P  be 
the  equilibrium  point  (corresponding  to  u^  =  0)  so  X^(P)  =  0  . 

Then  system  (4.1)  is  said  to  be  locally  controllable  about  P,  if 
for  any  time  t  >  0  there  exists  a  control  u  such  that  any  point 
in  a  full  neighborhood  Rn  of  P  can  be  reached  in  time  t  by 
solutions  initiating  from  P. 

Theorem  4.2.2 

A  first  order,  sufficient  test  for  local  controllability  of 
system  (4.1)  along  its  equilibrium  point  P  is: 

dim  spanj (adJX°,Xl)(P):  i  *  l , . . . ,m;  j  *  0,1,..,}  3  n 


For  a  single  input  nonlinear  system,  this  condition  reduces 


to: 


dim  span}  (ad_1X^,X* ) ,  j  ■  0,...,n-l}  being  linearly  independent. 


The  proof  can  be  found  in  [37]. 

4.2.2  Feedback  linearization  -  single  input  case 

In  this  section  we  define  precisely  the  concept  of  feedback 
l inear  izat ion  and  state  the  single  input  feedback  1  inear izat ion 
theorem. 

Given  the  single  input  nonlinear  system  of  the  form: 


x  *  X^(x)  *  u(x)X*(x) 


(4.2) 


where  X  ,X  are  smooth  vector  fields  on  R  (or  M  ),  u  is  t 


control  and  P  is  the  point  along  which  we  want  to  linearize. 
Given  also  the  linear  time  invariant  canonical  form: 


y  *  Ay  ♦  by  ,  A 


1 — 

o 

o 

o 

o 

1 _ 

“o“ 

0  0  1  0  ...  0 

• 

,  b  3 

0  ...  01 

0 

_0  ...  0. 

_1__ 

where  u  is  the  control. 

.0,  ,  _  .  .  „1 


(4. 


Let  V  (y)  *  Ay  and  V  (y)  =■  b  ,  be  vector  fields  on  R  ,  i.e.: 

(4. 


V°  * 


The  feedback 
When 


linearization  problem  can  be  expressed  as  follows: 
can  one  choose  a  coordinate  transformation 


y  *  <Kx)  , 


(4. 


where  is  a  local  dif feoraorphism  (differentiable  isomorphism), 
with  $(P)  *  0  ,  $^(P)  nonsingular  and  a  nonlinear  feedback 

control 

u(x)  *  v(x)  +  w(x)u  (4. 

where  u  is  a  free  new  control,  such  that  (4.5)  and  (4.6) 
transform  the  nonlinear  system  (4.2)  into  the  linear  system  (4.3) 
Specifically,  with  feedback  alone,  system  (4.2)  transforms  to: 

x  *  X^(x)  ♦  (v(x)  ♦  u(x) u)X ^ (x) 

x  -  W°(x)  ♦  yWl(x) 


v.e. , 


(4. 
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where  W^Cx)  *  X^(x)  +  v(x)  Xl(x)  and  Wl(x)  a  u(x)X^(x)  are 
smooch  vector  fields. 

While  with  both  coordinate  change  and  feedback,  we  have: 
y  *  $*(x)x  ,  is  the  Jacobian  of  $  . 

y  -  <^(x)(X^(x)  +  (v(x)  +  oj(x)  u)X^  (x) ) 

i.e. :  y  *  Y°(y)  +  uYl(y)  (4.8) 

where  Y^(y)  *  $*(x)(X^(x)  +  v(x)X^(x))  =  ^(x)W^(x)  ,  which  we 
want  to  be  V^(y)  , 

Y^(y)  a  ^(x)(u(x)x'(x))  a  $+(x)W*(x)  , 
which  we  want  to  be  V^y). 

We  note  that  V°  and  V1  generate  a  Lie  algebra  L(V^,Vl). 
Suppose  that  system  (4.8)  is  the  linear  canonical  form  (4.3). 

Since  $  is  a  di f feomorphi sm ,  then  by  theorem  (4.1.1),  b,y  is  a 
Lie  algebra  dif feomorphisra  too,  i.e.  cannot  change  the 

structure  of  a  Lie  Algebra.  Thus  the  Lie  algebra  L(W^*  ,W^  ) 
generated  by  W^  and  must  be  isomorphic  to  the  Lie  algebra 

L(V^,V  ).  In  other  terras,  it  is  necessary  to  be  able  to  choose 
v(x)  and  w(x)  to  have  such  \P  and  W* . 

Next  we  introduce  the  concept  of  feedback  equivalence  of 
two  systems. 

Def in  it  ion  (4,2.1) 

Two  systems  are  said  to  be  feedback  equivalent  if  one  can 
be  transformed  into  Che  other  via  a  local  coordinate  change  and  a 


feedback . 


Theorem  4.2.3 


The  n-dimensional ,  linear  time  invariant  systems  x  =  Ax  * 

Bu  and  y  =  Cy  +  Du  where  A,C  are  n*n  and  B,D  are  n*ra 

matrices;  are  feedback  equivalent  if  and  only  if  the  pairs  (A,B) 

and  (C,D)  have  the  same  Kronecker  indices. 

Furthermore  if  (A,B)  has  Kronecker  indices  K^  >.  K^  >_  ... 

K  ,  then  it  is  feedback  equivalent  to  the  following  canonical 
m 

.  c  c 

form,  z  =  A  z  +  B  u  ,  u  input  vector, 
c  •  • 

and  A  is  the  Jordan  block  diagonal  matrix  given  by 


AC  -  diag  ( A : . ,Am)  ,  At 


0  1  0 


0 

l 

0 


A^  being  a  (K^XK.)  matrix  with  ones  in  the  superdiagonal  and 
zeros  elsewhere.  BC  3  (b^,...,bm)  ,  b^  ■  [0  ...  0  1  0  ...  0 J C 


where 


is  a  (n*l)  vector  with  1  in  the  (  £  K.)  component 


and 


zeros  elsewhere. 

The  proof  can  be  found  in  [38]  . 

The  following  theorem  for  the  single  input  linearization 
problem  gives  necessary  and  sufficient  conditions  for  the  existence 
of  the  diffeomorphisra  $  and  the  feedback  control  u  .  It  also 
gives  the  explicit  form  of  $  and  u  . 


Theorem  4.2.4  -  Single  input  feedback  linearization  theorem 


Given  the  single  input  nonlinear  system: 


x  =  X^(x)  +  u(x)X^(x) 


(4.9) 


where  X^,  x''  are  smooth  vector  fields  on  Mn,  u  is  the  control 
and  P  is  the  point  about  which  the  linearization  is  desired. 


Let  y  =  Ay  +  bp 


(4.10 


0  10  0 


0  0  10 


with  A 


Lo  ...  oj  LU 

be  a  canonical  linear  time  invariant  system  with  control  u  . 

Then  three  necessary  and  sufficient  conditions  for  system 

(4.9)  to  be  feedback  equivalent  to  the  canonical  form  (4.10)  are: 

i)  there  exist  a  real  a  such  that: 

X°(P)  +  oXl(P)  =*  0  . 

ii)  X^(P)  ,  (adX^ ,X ^ ) (P) , . . . , (adn  *X^,X*)(P)  are  linearly 
independent  (sufficient  local  controllability  test, 
theorem  4.2.2). 

iii)  the  set  {x\  (adX°,Xl) . (adn‘2X°  ,Xl) }  is 

involut ive . 

Furthermore,  when  the  above  three  conditions  are  satisfied  and 


since  the  involutive  condition  3  implies  that  there  exist  a  smooth 
function  h:  Rn  ♦  R^  such  that  h(P)  ~  0  .  (dh)(P)  *  0  and 


(ad JX  ,X  )h  -  <(adJX  ,X  ),  dh>  =  0  for  j  =  0,1 . n-2.  Then  the 

dif feomorphic  coordinate  change  y  =  $(x)  with  $(P)  =  0  and 

$A(P)  non-singular  is  given  by: 

y.  =  (L^'hXx)  ;  j  =  1 . n 

J  xu 

and  the  feedback  u(x)  a  v(x)  +  u(x)u  ,  where  u  is  a  new  free 
control  (of  (4.10))  is  given  by: 


(Lu  h)(x) 

X  _ 

(L  Ln~lh)(x) 

xL  xu 


(L  Ln"lh)(x) 
X  X 


Proof . 


The  proof  of  this  theorem  is  constructive,  i.e.  in  the 


process  of  proving  the  three  necessary  and  sufficient  conditions, 
the  coordinate  transform  and  the  feedback  u  are  constructed. 


Necess  arv  part 


i)  We  have  V°(y)  a  Ay  ,  so  V°(0)  a  0  .  Since  is  a 

Lie  algebra  dif feomorphism,  it  is  necessary  that  W^(P)  =*  X^(P)  + 
v(P)X*(P)  be  zero.  Hence  it  is  necessary  that  there  exist  a  real 
a  such  that  X^(P)  +  aX^(P)  *  0  . 

ii)  We  note  that  V^O),  (adV°,Vl)(0) . (ad"' 1V°  ,V  1 )  ( 0) 

are  linearly  independent  (i.e.,  the  linear  controllability 
condition  is  satisfied).  Again,  since  is  a  Lie  algebra 

isomorphism,  it  is  necessary  that  W*(P) . (ad"  ^W^ ,W  )(P)  are 

linearly  independent  since  L(W^ ,W^ )  must  be  isomorphic  to 


•  IW  |l .'‘i*.  I>«  il.  lk.  ll.  iU  111  4U  1^4*.  it.  il.'>U**l,  l<4‘|U  | 


WWlUfUVWl  v  jrwi  *VT 


L(V°  ,V  ^ )  .  But,  (adW°  ,W^ )  =  [X°  +  vX1,  wX1]  =  [x° ,  toX 1  ]  +  [vX1,^1] 

-  <*)[x°  ,X 1  ]  •*■  (X°<u)X1  +  v(X1ui)X1  -  ii)(XLv)Xl  ,  and  since  W1  =  wX 1  , 
then  W^(P),  [w^W1  j  (P)  are  independent  if  and  only  if  X^(P), 

[x° ,X 1 ] ( P)  are  independent. 

2  0  1 

Continuing  with  (ad  W  ,W  )  etc.,  we  obtain  the  second 
necessary  condition: 

Xl(P)  ,  (adX°  .X1  HP) . (adn~  1X°  ,Xl)(P) 

are  linearly  independent.  This  condition  is  sufficient  for  system 
(.it. 9)  to  be  locally  controllable  at  ?.  However  we  can  have  system 
(4.9)  locally  controllable  at  P  but  not  have  the  second  condition 
of  the  theorem  satisfied.  In  this  case  we  cannot  transform  via 
feedback  and  a  coordinate  change  to  the  linear  system  (4.10). 


(iii)  At  this  point,  one  should  note  that  for  the  linear 


system  (4.10),  (adnV°  .V1  )  *  0  .  Also  [  (adJVU  .V1  ) ,  (adV,VL)]  *  0 


v1 


,l..O  ..I 


for  all  j,  l  .  Again,  we  must  require  this  for  the  similar 
products  of  .  Indeed,  computing  in  the  linear  system  shows 

{ V  ,  (adV^.V^) , . . . , (adn  ^V^,V^)}  is  an  involutive  set.  Thus  again 
since  the  Lie  algebras  L(W^,W^),  L(V^,vb  are  isomorphic,  we 
require  that  {w^,...,(adn  ^W^,W^)}  be  involutive.  But,  (adW^.W*) 
-  wfx^X1]  ♦  (X^X1  +  v(XIut)XI  -  wtX^X1  .  So  as  for  the  second 
necessary  condition,  we  conclude  that  it  is  necessary  that: 

fx^ . (ad°  ^X^.X^)}  is  involutive. 


t"r 


Sufficient  pare. 

We  want  to  show  that  when  the  three  conditions  are 

satisfied,  there  exist  a  diffeomorphisra  $  and  a  feedback  u  such 

that  systems  (4.9)  and  (4.10)  are  feedback  equivalent.  We  will 

also  give  the  explicit  form  of  and  u. 

By  necessary  condition  1,  there  is  an  a  such  that  X^(P) 

+■  aX^(P)  =  0  .  One  can  rewrite  system  (4.9)  as:  x  =  (X*"*(x)  + 

aX^(x))  +  (u(x)-a)X^ (x)  ,  i.e.  x  *  X^(x)  +  u(x)X^ (x)  ,  where 

X^(x)  *  X^*(x)  +  aX^(x)  vanishes  at  P  and  u(x)  *  (u(x)-o)  is  a 

new  control.  Thus  we  can  and  will  assume  that  X^(P)  =  0  . 

The  involutive  necessary  condition  3  implies  there  exists  a 

smooth  function  h:  Rn  *  R^  ,  with  (dh)(x)  *  0,  such  that: 

(ad^X° ,X 1  )h  -  0  j  *  0, . . . ,n-2  .  (4.11) 

We  will  also  choose  h  so: 

h(P)  =■  0  ,  (dh)(P)  *  0  . 

Now  we  let  the  coordinate  change  $  be: 

y.  ■  (iJ  *h)(x)  j  »  1 , . . . , n  (4.12) 

J  X 

with  y  -  (y1 , . . . ,yn)  -  $(x)  . 

We  claim  that  the  map  $  satisfies  #(P)  »  0  and  $+(P) 


is  non-singular,  i.e.  $  is  a  local  dif feomorphism. 


a)  Indeed  we  have  already  chosen  h  such  chat  h(P)  =  0  i . e . , 


y^P)  -  0  .  Also,  y2(P)  =*  (L  Qh)(P)  =  <X°  ( P)  ,  dh  ( P)  >  =  0  , 


since  X°(P)  =*  0  .  Similarly  y.(P)  =  (L^"lh)(P)  =■  0 


j  *  2 . n  ;  because  X  (P)  *  0  .  Hence  $(P)  *  0  . 


b)  We  wish  co  show  chat  $  (P)  is  non-singular,  i.e.  dh(P), 


(dL  _h)(P) , . . . , (dL  n)(P)  are  linearlv  independent. 
X°  X°  ’ 


We  already  have  dh(P)  *  0  .  Suppose  (dh)(P)  and 


(dL  h)(P)  are  dependent,  i.e.  ^  a  *  0  such  that 


(dL  -h)  (?)  =•  a(dh)(P) 
X° 


Then,  (adX°  ,X 1  )h(  P)  -  [x°,X1]h(P)  =*  X1(X°h)(P)  -  X°(X1h)(P)  = 


X1(X°h)(P)  *  OC1  ,dL  nh>(P)  =■  <X1,adh>(P)  -  a(XIh)(P)  which  means 
X 


that  X*(P)  and  (adX°,X^)(P)  are  dependent.  Inductively  if 


(dLJnh)(P)  is  a  linear  combination  of  (dLl-h)(P)  i  *  0,...,j-l  ; 


we  conclude  that  (ad^+ ^X^ ,X ^ ) ( P)  is  a  linear  combination  of 
(ad ^X^.X*) (P)  0  <  i  <  j  ,  which  contradicts  the  independence  of 


X*(P) . (ad°  ^X^,X^)(P)  of  necessary  condition  2. 


Hence  $^(P)  is  non-singular  and  <p  is  a  local 


dif feomorphism  given  by  (4.22).  Next,  we  construct  the  feedback 


To  shorten  notation,  let 


V(x)  *  X^(x)  +  u(x)X^(x)  *  x 


^  jf*  a  j  ~  f » **  m  ^  m  *. .  * t  y»  * 


First  we  wish  to  show  that 


(Ljh)(x)  -  (Lj  h)(x)  ,  j  -  1 , . . . , n- 1 

V 


and 


L  L  h  -  0  j  *  1 , . . . ,n-2 

V  X 


For  j  *  1 


L.h  -  (X°  +  uX1) h  =  X°h  =■  L  .h 

*  xu 


For  j  =•  2 

L^h  -  L^h  -  (X°  +  uXL)L  h  =  L2  h  +  uL  L  h 

x  X  X  X 

Again  from  (4.11) 


(adX°,XI)h  -  X2(X°h)  -  X°(X1h)  -  X1  (X°h)  -  L  L  _h  -  0 

x1  xu 


Hence 


2  2 
V  "  \oh  ’ 

A 

Continuing  in  this  fashion  and  as  long  as  (ad^  2X^,X2)h  3  C 
which  is  valid  for  1  <_  j  n-1  we  have 

I^h  =«  L2Qh  1  I  j  1  n-1 

X 

Since  X2 (P) f . . . , (adn_ 2X^ ,X2 ) (P)  are  independent  and  X^(P) 
it  follows  that  XX(P)  #  0.  We  have  (adX°,Xl)h  -  X^X^)  - 
X^CX^h)  -  X1(X°h)  *  L  . L  *  0  .  Inductively,  from 

X* 

(ad^X^.xSh  ■  0  for  j  =  l,...,n-2  we  have 


E 


to  make  the  last  equation  become  y 
the  linear  system  (4.10). 


,  and  system  (4.15)  is 


Remark. 

In  many  applications  the  linearization  is  to  be  done 
around  the  equilibrium  point  corresponding  to  zero  controls.  In 
this  case  P  is  such  that  X^(P)  *  0  and  the  first  condition 
of  the  theorem  becomes  trivial  (a  =  0). 

4.2.3  Feedback  equivalence  -  multi-input  case 

In  this  part  we  give  the  multi-input  feedback  lineariza¬ 
tion  theorem  which  is  an  extension  of  the  single  input  one.  We 
will  show  that  when  the  linearization  works,  the  nonlinear  system 
is  feedback  equivalent  to  a  canonical  linear  system  determined 
by  the  Kronecker  indices  of  the  nonlinear  system. 

Theorem  4.2.5.  Multi-input  feedback  linearization  theorem 

Consider  the  multi-input  nonlinear  system  of  the  form: 


Let  P  be  the  point  around  which  we  wish  to  linearize. 

The  necessary  and  sufficient  conditions  for  system  (4.16), 
with  Kronecker  indices  K,  >  K.,  >  ...  >  K  ,  to  be  feedback  equi- 

1  —  4  —  —  d 

c  c 

valent  to  the  linear  time  invariant  canonical  form  i  =  A  z  +  B  u 
as  given  by  theorem  (4.2.3)  are: 

1  -  There  exist  reals  a  a  so 


X°(P)  ♦  l  a.x'(P)  -  0 


K  -  1  K  -  l 

2  -  Let  C (x )  -  {x1 . (ad  1  X°,X!),X2 . 'ad  2  X°,X2) 

K  -1 

.  . . .X®, . . . , (ad  m  X°  .X®)}  .  Then  dim  C(P)  -  n  (local 
controllability  test). 

3  -  For  each  i  *  let 

i  K;~2  n  i  ■>  K  -2  n  , 

C.  -  (X  . (ad  X  ,X  ),X‘ . (ad  1  X  ,X‘) . X®, 


K.-2  - 

. . . i (ad  1  X°,X®)} 


Then  C.  must  be  involutive  and  Span  C.(x)  a 

i  t 

Span  (C.  fl  C)(x)  i.e.,  C.  (x)  c:  C(x)  for  all  i. 

i  i 

The  proof  of  this  theorem  is  similar  to  the  one  given  for 
Che  single  input  case  and  can  be  found  in  [25], 


Remark: 


As  for  Che  single  input  theorem  when  the  point  of  interest 


P  is  the  equilibrium  point  (in  most  applications  it  is)  such  that 
X^(P)  ■  0,  the  first  condition  of  the  theorem  becomes  trivial 
(a.  *  0,  for  all  i). 

In  the  single  input  case  the  construction  of  the  diffeo- 
raorphism  $  and  the  feedback  control  u  from  the  real  valued 
function  h  was  proved.  But  for  the  multi-input  case  the  con¬ 
struction  work  has  not  been  proved,  we  rather  guess  the  functions 
h.(x)  as  it  will  be  shown  for  the  n-joint  robot  manipulator. 


A  special  case  of  interest  is  when  the  Kronecker  indices 


are  all  less  or  equal  to  two.  In  this  case  the  third  condition 


I  2  m . 

is  always  true.  Indeed,  »  •  X  ,X“ . X  '  which  is  an 


involutive  set  and  c  C  for  all  i. 


4.2.4  Application  to  robotics 


In  this  section  we  will  design  a  nonlinear  controller  for 


robot  manipulators  by  employing  the  feedback  linearization 


technique.  We  will  first  show  that  the  feedback  linearization 


theorem  is  applicable  to  linearize  the  dynamic  equations  of  a 


n-joint  robot  manipulator.  Then,  we  will  derive  a  aenerai 


expression  of  the  feedback  controls.  We  will  illustrate  this 


design  procedure  by  two  examples,  a  two  joint  and  three  joint 


manipulator.  Computer  simulations  for  performance  evaluation  are 


also  given. 


In  chapter  III  we  have  seen  that  the  dynamic  equations 


of  a  n-joint  manipulator,  as  given  by  (3.1),  have  the  following 


D  (q )  q  +  H  (q , q  )  +  G(q) 


(4.17) 


where  q  **  (q^,...,q  )  is  a  vector  of  the  actual  joint  posit) 


If  we  let 


q  -  x 


i  2  i  —  l 


qL  "  x2i  1  “  1 . n 


then,  the  state  space  representation  of  (4.17)  can  be  written  as 


*«■  s 


ft(x) 


Uj (x)  +  0  u,  (x)  + 


a  .  Cx) 
1 


f  (x) 

-  n  - 


+  u  (x) 

a 


t  4  .  i  3 ) 


where  u.(x)  concains  a  nonlinear  combination  of  some  torques  t.. 

1  J 

We  will  assume  that  f.(x)  is  a  smooth  real  function,  for  all  i. 

i 

System  (4.18)  is  then  a  muiti-input  nonlinear  system  of  the 

form : 


0  i 

x“X(x)+  i  X  (x)u,  fx) 


where  X^;  X1,  i  *  1  ,  .  .  .  ,n  are  smooth  vector  fields  as  given  by 


(4.18). 


Let  P  be  the  equilibrium  point  of  (4.18),  corresponding 


to  u .  *  0 ,  i . e . 

i 


X  (P)  -  0  . 


Next  we  show  that  this  state  representation  leads  to 
Kronecker  indices  all  less  than  or  equal  to  2. 
i)  Kronecker  indices: 


R1  -  span  [X1 (P) , . . . ,Xn(P) ]  =  span  0 


l,  »  din  R  9  n  . 


R2  -  span  [X[(P) , (adX° .X1 )(P) , . . . ,Xn(P) , (adX°,Xn) (P) ] 


X° ( x )  - 

X 


0  i  0  0 


»*l  3x2 


0  0  0  10 


3f,  3f, 

_ 2  _ 2 

3Xj  3x2 


►  "  •**'••«  '  l»  *  a  •  •  *  «  *  I  *  i  ’  »  'I,  "  «  *  i  •  k  ■’V  »''■  .*•  V  •  ,  *  *,  \  *  \  *,  *»*  \ 


1 2  3  dim  R  -  dim  R  =  n 
l ■  -  0  for  i  >  3  . 

t  — 

The  Kronecker  indices  are  then: 


-*  l. 

1.  >  1; 

1 

l  — 

{#  i. 

l.  >  2} 

2 

■} 


K  =  {#  L.  1.  >  n }  =  2 
n  i  i  — 

K .  =  0  for  i  >  n  . 

i 

Hence  all  Kronecker  indices  are  alL  less  than  or  equal  to 
2.  Next  we  check  the  three  necessary  and  sufficient  conditions  of 
the  feedback  linearization  theorem. 

Since  we  are  interested  in  linearizing  around  the 

g 

equilibrium  point  P,  with  X  (P)  =  0,  the  first  condition  is 
trivia l . 

Also,  due  to  the  fact  that  all  Kronecker  indices  are  less 
than  or  equal  to  2,  the  third  condition  is  immediate. 

For  the  second  condition  let 

C(x)  =*  {X1  ,(adX°  .X1  ),  .  .  .  ,Xn,(adX°  ,Xn)} 

dim  C(P)  “  dim  R2  =  2n 


Hence,  the  nonlinear  system  (4.18)  is  locally  controllable  about 


Thus,  the  feedback  linearization  theorem  is  applicable  and 
the  nonlinear  system  (4.18),  which  is  the  representation  of  a  robot 


manipulator  dynamic  equations, 
following  linear  time  invariant 


which  consists  of  n  uncoupled 
form: 


*1 

*2 

*2n-l 

*2n 

where  u.'s  are  the  controls. 


s  feedback  equivalent  to  the 
canonical  system: 


second  order  subsystems  of  the 


U 

n 


ii)  Synthesis  of  the  feedback  controls. 

As  mentioned  previously,  the  construction  of  the  coordinate 
change  $  and  the  feedback  control  u  is  not  proved  for  the 
multi-input  feedback  linearization  theorem.  Proceeding  as  for  the 
single  input  case,  we  will  'guess'  n  real  functions  h^ , . . . ,h  : 

Rn  ♦  R*  ,  such  that 

*  h^(P)  =*  0  i  =  1, . .  .  ,n 

*  dh  (P),...,dh  (P)  are  linearly  independent. 

*  (adJX^,Xl)h  =0  'f  i,j,m  i.e.: 

in 

L  . h.  =  <Xl ,dh . >  ■  0  V  i,  j  . 

X1  J  J 

Choosing  h^ . hn  to  meet  these  three  conditions  can  be  done  by 

inspecting  the  vector  fields  X°,...,Xn  as  given  by  (4.18). 

One  may  choose 

h^  (x)  *  Xj 
h^  (x)  *  x^ 


h .  (x) 


i-1 


h  (x) 
n 


x 


2n- 1 


Indeed 


*  h, (0)  *  ...  -  h  (0)  =0 

*  dhj(0)  =*  (1,0,...,0)C,  dh  2  ( 0)  =  (0 ,0 , 1 ,  .  .  .  ,  0)  C  ....dh^O) 
*  (0,...,l,0)t  are  linearly  independent. 

*  L  .  h .  *  <Xl,dh.>  *  0  V  i,i  . 

xl  J  J 

The  coordinate  change  is  chosen  as: 
yl  "  hl(x)  "  X1 

y2  "  <L  0hl)(x)  "  <x°.dh1>  -  <X°,(l,0,...,0)t>  «  x2 
y3  =  h  (x)  =*  x3 

y4  “  (L  0h2)(x)  *  <X°,dh2>  =  x4 

:  (4.20) 

y2.-l  -  H.U)  -  Xj._, 

?2i  '  '  <X°.dhi>  ■  *2i 


y.  -  h  (x)  ■  x_  . 
'2n-l  n  2n-l 


y2n  “  (L  Ohn)(x)  "  ^  ’‘V  3  X2n 


Equations  (4.20)  define  the  coordinate  transfom  y  =  $(x)  *  lx 
(I  *  Identity),  with  $(0)  =  0  and  $^(0)  a  I  • 

The  coordinate  transform  is  then  the  identity,  i.e.  no 


explicit  coordinate  change  is  needed  and  the  linearization  will  be 
accomplished  by  feedback  alone. 


At  this  point  we  note  that  if  we  choose  the  state 


representation  (4.18)  such  that  f^(x)  ■  0  for  all  i,  we  get 
exactly  system  (4.19)  with  controls  u.'s  instead  of  u.'s. 

i  i 

Since  $(x)  =  lx  ,  obviously  y  =  x  .  But  for  the  purpose 
of  showing  the  feedback  synthesis  for  any  d i f f eomorph i sm  $(x)  we 
will  continue  the  construcion  work  as  follows: 


Differentiating  (4.20)  gives: 


=*  dh  1  *x  =■  <X°  +  uLXl  +  ...  +  u^x",  dh^ 

*  <x0'dV  ’  Lv0hl  ■  *2 

A 


y2  =■  dL  q  h  j  •  x  -  <X°  +  UjX1  +  ...  ♦  uXn,  dL  Qh1> 
X  X 

=*  L2  h  +  u  L  .  L  ,h  +  ...  ♦  u  L  L  h 
X°  1  1  X1  X°  1  Vx01 


y2i-l 


dh  .  *x  *  <X°  ♦  u.X1  +  ...  +  u  Xn,  dh . >  (4.21 

i  L  n  i 

<x°’dhi>  *  t-xoh;  *  x2i 


dL  0h.«x  *  <X°  +  u.X1  +  ...  ♦  u  X",  dL  „h.> 
X°  1  1  n  x°  1 

2 

Lnh.  +u,  L.Lnh.  *■  ...  ♦  u.  L  .  L  -h.  +  ... 
X°  1  1  X1  X°  1  1  X1  X°  1 

♦  u  L  L  _h  .  . 
n  xn  X°  1 


y„  .  *  dh  *x  ■  <X^  +  u.  X1  *  ...  *  u  x" ,  dh  > 
'2n-l  n  1  n  n 


<X  ,dh  >  ■  L  rth  -  y_ 
n  ^0  n  x2n 


y.  -  dL  nh  »x  »  <X°  ♦  u.X1  ♦  . . .  ♦  u  X° ,  dL  nh  > 
2n  ^0  n  1  n  ^0  n 

2 

*L_h  ♦  u.L  ,L  rth  +  ...  +  u  L  L  nh 
On  l  ..  I  ..0  n  n  ..n  0  n 


with 


L/' 


L>- 


L  I  L  0  hI 
X  X 


L  iL  ohi 

X  X 


<X°,dh1>  =■  x2 

<X°,dL  0hL>  »  <X°,(0,l,0,...,0)t>  -  f 
X 

<xl,  dL  h  >  =  <X[, (0,1,0,.. .,0)t>  =  1 
xu  1 

<xl,  dL  Qh j>  *  <xl,(0,l,0,...,0)t>  = 0 


L  Qh.  =■  <X°,dh.>  *  <X°,(0,...  ,0,1,0 . 0)C>  = 


21-1 


L^h.  -  <X°,dL^0h.>  -  <X°,(0,...>0,)l,01...,0)t 


2  i 


L  .L  0h  -  <Xl,dL  h  >  -  <Xl,(0 . 0,1,0 . 0)C 

X1  X°  1  X°  I 


2  i 


L  .  L  _h.  ■  0  Vi#j  i-1 . n 

XJ  X°  1 


L 

X 


0hn 


L 

X 


L 

n  x° 


n 


L  .  L 
xl  X 


0  n 


<X^,dh  >  *  x, 
n  2n 

<Xn,dL  nh  >  -  <Xn,(0 . 0,l)t>-  I 

x°  n 

0  i  ■  1 . n-1  . 


Hence  (4.21)  becomes: 


i  *  i'i t  ir  i 


y_  .  3  f  .  (x)  +  u  .  (x)  =  u  • 
J  2 1  1  t  i 


and  the  explicit  feedbacks  u^(x)  are  given  by: 


u  .  (x)  *  -  f.(x)  +  u. 

i  i  i. 


where  u.  is  a  new  free  control. 


(4.23) 


The  control  problem  of  the  robot  manipulator  is  now  reduced 
to  controlling  the  n  linear  canonical  and  uncoupled  subsystems 


(4.19). 


The  objective  of  the  control  is  to  move  the  robot  hand  from 


one  path  point  to  another  desired  one,  or  through  the  inverse 

kinematics,  from  one  set  of  initial  joint  positions 

x,,x-,...,x-  ,  to  the  desired  joint  positions  w,,w,,...,w  ,  . 

13  2n-l  r  13  n-1 

Equivalently,  via  the  coordinate  transform  (4.20),  this  is  the 

problem  of  moving  from  the  states  y, ,y,,...,y-  (of  canonical 

L  J  <!n-l 

form  (4.19))  to  desired  states  ,  Zj , , . . . , z2n_ j • 

Subsystems  of  (4.19)  are  in  this  form: 


y2i-l  "  ^2i 


y2i  *  vi  1  m  1 . n 


(4.24) 


which  is  a  double  integrator,  type  two  system  with  transfer 


function: 


y2i-l(s)  1 


c 


System  (4.24)  can  be  stabilized  by  a  conventional  state  feedback 


Let 


U  • 
i 


V i  "  K2 i-l y2 i-1  "  K2iy2i 


1 , . .  .  ,n 


i .  e  . : 


with  transfer  function 


y2i-l(s> 

v  .  ( s) 
l 


s  *K2i—K2i., 


.  To  make  the 


steady  state  response  y^.  ^(t)  *5e  c^e  desired  z^  let 

v^(s)  =  L  • z2^_^ -u(s)  where  u(s)  is  a  step  input.  Hence, 


y» .  . ( t)  L .  — 

,  •  '2i-l  ,  .  is 

lun  -  »  hi  s 


t+~  Z2i-l  s-K)  s2>K..s+K„.  .  K2  i- 1 

2  v  2 i-l 


ft 


Hence  y^^C*)  ”  Z2t-l  Li  *  K2i-1  '  Therefore  the  control 


is: 


Ui  *  K2i-l(z2i-l  ‘  *2 i-l5  “  *2iy2i  1  "  1 . n  '  (4‘25) 


We  have  shown  that  the  original  system  (4.18)  and  the 


linear  canonical  form  (4.19)  are  feedback  equivalent.  This 
feedback  equivalence  implies  that  both  systems  have  the  same 
dynamic  behavior.  Again,  in  most  robot  applications,  overshoot  is 
to  be  avoided.  To  ensure  the  fastest  response  with  no  overshoot, 
the  feedback  gains  K^i’  ^2i  1  are  ctl0sen  ^or  a  cr'-c^caliy  damped 
response  of  the  second  order  subsystems  (4.19). 

By  the  coordinate  transform  (4.20)  we  can  express  u ^  as: 


K2i-l(“2i-l 


‘  «2l-l)  - 


K. . x„  . 
2 1  2  l 


Thus,  the  explicit  feedbacks  u^(x)  are: 


u .  (x) 

i 


-  f.(x) 

i 


K2i-l(w2i-l 


-  X2  i—  1 5  ‘ 


K_ . x_  . 

2  i  2 1 


(4.26) 


where  f^(x)  is  given  by  (4.18) 

K2i’K2i  1  are  ^ee^^aci<  gains 

w^._j  is  the  desired  joint  position,  i  *  l,...,n 


The  actual  concrol  torques  T  ^  (i  *  1 . n)  to  be  applied  by  the 

actuators  are  obtained  by  solving  for  T ^ ,  the  set  of  equations 
u.(x)  ;  i  *  1 , . . . , n  . 

Note  that  we  assume  that  the  states  x^ . x^  (joint 

positions  and  velocities)  are  all  measurable. 


Here,  we  summarize  the  procedure  we  went  through  to  derive 


the  feedback  controls. 


First,  we  obtained  the  state  space  representation  (4.13) 


from  the  dynamic  equations  of  an  n-joint  robot  manipulator.  For  a 
six-joint  manipulator,  the  expressions  of  u. (x)  and  f.(x)  can 

i  i 

contain  hundreds  of  algebraic  terras.  In  spite  of  the  complexity  of 
the  robot  dynamic  equations,  state  representation  (4.18)  led  to 
Kronecker  indices  all  less  or  equal  to  two.  This  made  the 
multi-input  feedback  l  inear  izat ion  theorem  readily  applicable.  The 
coordinate  transform  was  found  to  be  the  identity  and,  explicit 
nonlinear  feedbacks  were  derived. 

Next,  we  will  illustrate  this  procedure  by  two  examples,  a 
two  and  three  joint  manipulator.  For  these  two  examples,  a 
computer  simulation  study  is  conducted  to  evalute  the  performance 
of  the  nonlinear  controller.  We  will  also  compare  with  the 
'computed  torque’  technique  used  in  the  previous  section  (4.1). 


Example  4.2.2:  Two  joint  manipulator 

From  the  dynamic  equations  given  by  (3.6)  and  (3.7),  one 


notices  that  due  to  the  coupling  effect,  3^  and  8^  are  present  in 


both  equations.  In  order  to  derive  the  state  space  representat ion 


we  first  solve  for  8^  and  8^ .  Then  we  obtain 


a2^X^l*2  . 2 

- -  ( 8^  sin  8^  ♦  g  cosOj+e^) 


*2  (x)-a  j  (x)b^2 


b2l2 


a. (x)-a . (x)b  l 


(-b  l  l  sin  8  (28  +8  )8 
2  ° 1  l  2s 2V  l  22 


( 


.V 


CC 


5; 


f 


I 


64 


X4  "  “2 


Hence  the  state  space  representation  is: 


X1  *  x2 


=  f  ^  (x)  +  Uj (x) 


(4. 3  L ) 


X3  =  X4 


x^  =  f.,(x)  +  u0(x) 


where 


f  L(x) 


a2(x) 


a2 (x)-a1 (x)b2i. 


2  b^l^sin  x3 


(4.32) 


gL(x) 


a2  (x) 


a2(x)-a1(x)b2i2 


2  b  gi  cos^+Xj)  +  — 


? 

b„C 

2  2 


a“(x)--a1(x)b2-t“ 


^"b1^1^2Sin  x3^2x2+x4)x4  +  b38‘t’lCOS  xi  +  b1g-i?cos(x1+x3)) 
,  (4.33) 


u1  (x) 


gj(x)  +  — 


a2(x)T2-b2i2T1 


(4.34) 


a2  (x)-a1 (x)b2i2 


f2(x) 


al  (x) 


a2 (x)-a  ^ (x)b2i2 


2  bihZ2x2sin  x3 


(4.35) 


g2(x> 


a3  (x) 

— ~ - x  b  gt,cos(x,+x.) 

a2(x)-ai(x)b2i2  1  J 


a2  (x) 


a2 (x)-a  (x)b2i2 


2  (b^£^42sin  x3(2x7+x^)x, 


-  b 3gi ^cos  x1  -  b1gi2cos(x1+x3)) 


>»'■  "*■  •*- -  .f.  /.  i1. . «•- c. <*. 


---  ^ 


(4.36) 


(4. 


a2 -x)Ti_ai (x)T9 


u.,(x)  =  g2(x)  +2  , 

a2(X)~a1 (x)b,^2 


System  (4.31)  can  be  put  in  this  form: 


x  =  X°(x)  +  X*(x)u.(x)  +  X2(x)u_,(x) 
1  <_ 


where 


X°(x) 


*:<2 

"O' 

’o' 

fL(x) 

X4 

,  X 1 (x)  • 

1 

,  X2(x)  = 

0 

0 

0 

f7  (x) 

0 

1 

(4.3 


(4. 


Uj(x),  u7(x)  are  the  controls,  and  P  =  0  is  the  equilibrium 
point  of  (4.38). 

We  first  check  that  indeed  the  Kronecker  indices  are  all 
less  or  equal  to  2. 


=  dim  R1  *  dim  {X^X2}  *  2 
R2  -  {Xl(P),  (adX°,X1)(P),  X2(P),  (adX°,X2)(P)} 


4 


2 

Thus,  dim  R  = 

2  1 

and  l ^  *  dim  R  -  dim  R  =  2 

l.  =  0  for  i  >  3 

l  — 

The  Kronecker  indices  are: 

K.  =>  {#  l.  ii  >  1}  =  2 

1  i  — 

K-  *  {//  2. .  I  l.  >  2}  =  2 

2  l 1  i  — 

K.  =  0  i  >  3 

l  — 

all  less  chan  or  equal  to  2. 

For  the  conditions  of  the  feedback  linearization  theorem, 
since  X^(P)  *  0,  the  first  condition  is  trivial.  We  showed  that 
all  Kronecker  indices  are  less  than  or  equal  to  two.  This  makes 
condition  3  always  satisfied.  For  condition  2,  let 

C  *  {X1,(adX°,X1),X2,(adX°,X2)} 

2 

then  dim  C(P)  ■  dim  R  =  4  .  Hence  condition  2  is  true.  Therefore 
system  (4.38)  is  feedback  equivalent  to  the  canonical  form  (4.40) 


By  inspection  of  the  vector  fields  X  ,  X  ;  the  real  functions 


h^  and  h,  are  chosen  to  be 


h^(x)  =  x^ 


h2(x)  =  x^ 


such  that. 


h^O)  *  h2(0)  -  0 

dh^O),  dh2(0)  linearly  independent 

L  .h.  =  0  i,j  =  1,2 
X1  J 

The  coordinate  transform  is  then, 

=  ht(x)  =  xL 

y2  -  (L  ^Xx)  -  <X°,(l,0,0,0)t>  =  x2 
X 

y3  ’  x3 

y4  -  (L  Qh2)(x)  =  <X°, (0,0,1, 0)C> =  x4 


which  is  the  identity. 


are : 


The  feedback  controls  u^(x),  u7(x)  as  given  by  (4.26) 


u1 (x)  -  -  f , (x) 

+  Kj  (w^-Xj)  -  K2x_, 

(4.42) 

u2(x)  -  -  f2(x) 

+  K3(w3-x3)  -  K4x^ 

(4.43) 

where  Kp  K.,,  K^,  are  feedback  gains  and  w.,  w^  are  the 
desired  joint  positions. 

The  actual  control  torques  to  be  applied  by  the  actuators 
are  obtained  by  solving  equations  (4.34)  and  (4.37)  for  T^  and 
T^  •  We  finally  have: 

Tj  =*  (x)^-  f  ^  (x)  +  K1(wl~x1)  -  K2x,  -  gjCx)) 

+  a2(x)(^-  f0(x)  +  K3(w3-x3)  -  K^x^  -  g,(x))  (4.44) 

T2  =  a  2  ( x )  ^-  fj(x)  +  Ki  (wl~xi )  *  K2x?  “  g^x)) 

+  b 2 ”^2 (~  +  K3 (w3-x3^  *  K4x4  "  g2(x))  (4.45) 

When  these  torques  T^  and  T.,  are  applied  to  the  joints, 
the  nonlinear  system  (4.31)  (robot  model)  has  the  same  dynamic 
behavior  as  the  uncoupled  second  order  linear  subsystems  (4.40). 

For  this  two  joint  manipulator,  a  computer  simulation  was 
performed  to  evaluate  the  performance  of  the  controller. 

We  simulated  the  hand  motion  from  one  path  point  to  the  next  point, 
or  through  the  inverse  kinematics,  from  one  state  of  joint  positions 
to  a  desired  state.  Initial  data  (9^  »  1  rd ,  92  *  -0.5  rd)  and  desired 
joint  positions  (6^  =  0.5  rd,  92  *  0 rd)  were  arbitrarily  chosen. 


2< 


I 


Figure  4 
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6.  Feedback  linearization.  Simul 
from  (1,-0. 5)  to  (0.5,0). 


System  (4.50)  can  be  expressed  as: 


x  ■  X^ix)  +  X^(x)Uj(x)  +  X^xlu^Cx)  X^(x)u^(x)  (4.57) 

where 


"2 

"o' 

0 

"0" 

f  J  (x) 

1 

0 

0 

0 

0 

0 

4 

1 

2 

3 

,  X  * 

,  X  * 

,  X  => 

f2(x) 

0 

1 

0 

X6 

0 

0 

0 

_  0  . 

.0. 

_0_ 

-1. 

Uj(x),  u^(x)  are  the  controls  and  P  *  0  is  the  equilibrium 

point  of  (4.50) . 

We  check  again  the  Kronecker  indices. 

=-  dim  R1  *  dim  { X 1  ( P )  ,  X2(P),  X3(P)}  »  3 


R2  -  {XL(P) ,(adX° .X1 )(P) ,X2(P) ,(adX° ,X2)(P) ,X3(P) ,(adX° ,X3)(P) } 


l 

'1 ' 

0 

0 

(adX°,X1)(P)  = 

0 

V  V  V 

(p)  - 

0 

(adX°,X3)(P)  »  (0, 0,0,0, 1 ,0)C 

2  .2.1 

Hence  dim  R  “6  and  *  dim  ^  ~  dim  R  *  3  ,  £.  *  0  i  ^  3 

The  Kronecker  indices  are  given  by: 

K1  ”  K2  "  K1  *  2  ’  Ki  "  0  1  -  4 

Thus  conditions  L  and* 3  of  Che  feedback  theorem  are  satisfied. 

2 

Also  C(P)  *  R  .  Thus,  dim  C(P)  *  6,  which  makes  condition  3  true 
System  (4.50)  is  then  feedback  equivalent  to  the  following 
linear  canonical  form  (4.58): 


where  u  p  u7  and  are  new  free  controls.  Again,  the  real 

valued  functions  h^,  h^,  h^  are  chosen  as: 

h  j (x)  =  Xj 

h^ ( x)  =  (4. 

h^(x)  3  x^ 

which  leads  to  an  identity  coordinate  change.  The  feedback 
controls  Up  u^  and  u^  given  by  (4.26)  are: 

Uj (x)  *  -  (x)  ♦  K^(Wj-Xj)  -  K^x7 


u^(x)  3  -  f^(x)  ♦  K^(w^-x^)  -  K^x^ 
u^(x)  * 

The  explicit  control  forces  F  ,  F  and  torque  T  are  obtained 

K  Z 

from  (4.53),  (4.55)  and  (4.56). 


K5(w5~X5)  ~  K6X6 


FR  “  bl^~  fi^x)  +  Kl^w["xl)  -  iC'5x->]  +  b3g  Sln  x3  Ci.^3) 
T  *  Cj(x)[-  f^(x)  +  K-j  C  W3  -sc  3 )  -  K^x^]  +  b^gx^cos 


“  2  b4g  cos  x; 

b3^K5(w5_X5)  "  KftxJ 


6  6  J 


(4.64) 

(4.65) 


These  control  forces  F  ,  F  and  control  torque  T, 

R  z 


transform  the  nonlinear  system  (4.50)  (robot  model)  into  the  linear 
canonical  system  (4.58). 

Computer  simulations  were  conducted  in  the  same  conditions 
as  for  the  'computed  torque'  technique.  We  first  simulated  the 
joint  motions  from  initial  positions  (r  3  1  m,  $  3  0  rd,  z  3  -1  m) 


to  desired  positions  (r  3  0.5  m,  $  *  -0.6  rd,  z  *  -0.2  m) .  K  3 


K.,  *  K,  3  400  and  K.  3  K.  *  K,  *  40  ,  were  chosen  for  a 
3  5  2  4  6 

critically  damped  response.  The  simulation  results  are  presented 
in  table  4.7  and  figures  4.8  and  4.9.  It  is  noted  that  there  are 
no  steady  state  errors  and  no  overshoot.  The  largest  control 


effort  required  is  3  4480  Newtons.  The  convergence  time  (time 


at  which  the  steady  state  is  reached)  is  0.7  seconds.  This  con¬ 
vergence  time  can  be  further  decreased  by  choosing  larger  feedback 
gains,  as  Cable  4.8  shows.  However,  this  will  result  in  much 
larger  control  torques  and  forces.  For  Che  'computed  torque' 
technique,  as  was  discussed  previously,  with  a  feedback  gain  K  3 


5000  we  observed  a  position  error  of  14  tmn  (for  r  and  z)  and  a 


maximum  control  force  F^  3  56993  N.  So,  clearly  there  is  an 


appreciable  improvement  in  performance  with  the  feedback 


linearization  approach. 


Table  4.7. 


Simulation  of  joint  motions 
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Figure  4.9.  Feedback  linearization.  Controls 


In  the  computed  torque  technique,  we  have  demonstrated  that 
it  was  possible  to  lower  the  control  effort  requirements  by 
utilizing  time  varying  feedback  gains.  The  same  idea  can  be 
applied  to  the  feedback  linearization.  We  first  specify  the 
maximum  permissible  control  torques  and  forces,  and  start  the  joint 
motions  with  small  feedback  gains.  Then  as  the  error  positions  are 
diminishing,  these  gains  are  increased  in  a  piecewise  manner  [34]. 

In  this  study,  we  applied  this  technique  to  the  three  joint 
manipulator.  The  maximum  torque  (force)  is  chosen  to  be  400  N.m 
(N)  and  the  feedback  gains  are  doubled  as  long  as  the  torque  limits 
are  not  exceeded.  Similarly,  if  the  limits  are  about  to  be 
exceeded,  the  feedback  gains  are  reduced  by  half.  Again,  we 
simulated  the  joint  motions  from  (r  =  1  m,  <p  »  0  rd ,  z  =  -1  m)  to 

(r  =  0.5  m,  $  *  -0.6  ra,  z  =  -0.2  m) .  The  simulation  results  are 

given  in  table  4.9a  and  figures  4.10-4.11.  Note  a  shorter 
convergence  time  of  0.5  seconds,  with  no  overshoot  and  no  steady 

state  errors.  This  was  achieved  with  only  102  (400  vs.  4480)  of 

the  control  effort  required  by  the  constant  feedback  gain  design. 

One  may  try  to  optimize  this  technique  by  using  the  maximum 
torques  and  forces  available,  which  would  result  in  an  even  faster 
response.  Such  an  optimization  can  be  accomplished  by  feedback 
gains  which  vary  by  a  small  increment.  This  increment  would  depend 
on  the  control  limits;  and  the  smaller  this  step  size,  the  faster 
the  response.  However,  as  the  simulations  showed,  if  the  increment 
is  too  small,  the  system  can  acquire  a  large  inertia.  Since  the 
controls  are  bounded,  this  could  result  in  overshoot. 


Table  4.9a.  Simulation  of  joint  motions  from  (1,0,— 1) 
to  (0.5, -0.6, -0.2) . 
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Table  4.9b.  Feedback  linearization  with  optimal  time 
varying  feedback  gains.  Simulation  of 
joint  motions. 
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Figure  4. 10.  Feedback  linearization  with  time  varying  feedback 
gains.  Simulation  of  joint  motions  from  (l, 0,-1) 
to  (0.5, -0.6, -0.2). 
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Using  Chis  optimaL  approach,  we  simulated  the  same  joint 
motions  with  a  torque  (force)  limit  of  400  N.ra  (N).  For  these 
control  bounds,  a  step  size  of  0.5  was  used.  To  avoid  the 
overshoot  problem,  we  switched  from  a  criticaLLy  damped  response  to 
an  overdaraped  response  in  the  vicinity  of  the  final  desired 
positions.  The  simulation  results  given  in  table  4.9b  and  figures 
4.13-4.14  show  a  decrease  in  convergence  time  of  approximately  0.1 
second  (0.4  vs.  0.5)  for  this  optimal  approach  over  the  case  when 
the  feedbacks  gains  were  doubled. 

This  optimal  approach  needs  further  investigation  in  order 
to  determine  the  exact  relationship  between  the  feedback  gain 
increment  and  the  control  bounds,  with  the  overshoot  as  a 
constraint . 

A  control  system  is  said  to  be  robust  if  it  can  accommodate 
disturbances,  parameter  variations,  and  model  inaccuracies.  Ve 
tested  the  robustness  of  our  nonlinear  feedback  controller  by 
introducing  an  error  in  the  actual  computed  control  torques.  As 
table  4.10  il lustrates,  up  to  602  error  still  gives  a  good 
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trajectory  tracking.  So,  the  controller  is  quite  robust. 

Finally,  we  simulated  the  preplanned  path  motion  of  table 
4.4  for  the  nonlinear  feedback  controller  and  the  'computed  torque' 
controller.  The  time  varying  feedback  gain  technique  was  used  in 
both  cases.  For  the  nonlinear  controller,  we  specified  a  maximum 


*1 


Table  4.10.  Effect  of  disturbances  and  parameter  variations  on  the 
tracking  accuracy. 


Final  position  error 
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Table  4.11.  Path  tracking  simulation. 
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Comparative  results  are  presented  in  table  -*.11.  We  note 
that  the  feedback  linearization  design  yields  a  sma ller  tracking 
error  and  no  final  position  error.  However,  the  'computed  torque' 
design  results  in  larger  tracking  error  and  a  steady  state  final 
position  error.  Again,  it  appears  that  the  feedback  linearization 
approach  has  a  better  overall  performance.  This  approach  combined 
with  the  time  varying  design  results  in  fast  and  accurate  tracking 
with  relatively  low  energy  control  requirements. 


CHAPTER  V 


CONCLUSIONS 

The  robot  control  problem  in  general  consists  of  planning 
trajectories  which  describe  desired  hand  motions,  and  then 
controlling  the  robot  to  ensure  that  those  trajectories  are 
correctly  executed.  In  order  to  move  the  hand  along  a  trajectory, 
control  torques  must  be  exerted  by  actuators  at  the  joints.  Thus, 
it  is  necessary  to  convert  the  desired  hand  trajectories  into  a 
time  sequence  of  desired  joint  coordinates  (inverse  kinematics). 

The  dynamics  of  a  n-joint  manipulator  is  very  complex. 

The  resulting  dynamic  mathematical  model  is  a  system  of  n  highly 
nonlinear  and  coupled  second  order  differential  equations.  For  a 
six-joint  manipulator,  this  system  can  contain  hundreds  of 
algebraic  terras.  The  dynamic  control  of  such  a  system  is  then  the 
problem  of  controlling  a  multi-input  nonlinear  system.  For  robots 
many  control  strategies  have  been  developed,  among  which  the 
commonly  used  open  loop  control  is  the  simplest. 

In  this  thesis,  we  presented  and  analyzed  a  widely  used 
independent  joint  control  method,  the  'computed  torque'  technique. 
This  approach  is  basically  a  PD  action  law  with  some  nonlinear 
compensations.  The  simulation  study  confirmed  the  fact  that  the 
independent  joint  control  requires  high-power  actuators  for 


adequate  path  tracking.  We  demonstrated  that  it  is  possible  to 
lower  this  requirement  by  using  time  varying  feedback  gains. 

In  this  thesis,  we  also  have  studied  a  nonlinear  control 
approach  based  on  feedback  linearization.  The  feedback 
linearization  is  a  global  linearization  of  a  nonlinear  system  via  a 
coordinate  change  and  feedbacks.  A  recent  development  in  this 
theory  is  a  theorem  which  gives  necessary  and  sufficient  conditions 
to  linearize  a  nonlinear  system  [25]. 

To  design  the  nonlinear  controller,  we  first  began  with  the 
derivation  of  the  state  space  representation  of  a  n-joint 
manipulator  dynamic  equations.  This  state  representation  led  to 
Kronecker  indices  all  less  than  or  equal  to  2.  We  then  showed  that 
the  three  necessary  and  sufficient  conditions  of  the  feedback 
linearization  theorem  are  all  satisfied.  The  fact  that  all 
Kronecker  indices  are  all  less  than  or  equal  to  2  resulted  in  a 
linear  canonical  form  composed  of  n  linear,  time  invariant,  second 
order,  uncoupled  subsystems.  The  control  problem  was  then  reduced 
from  controlling  a  multi-input  nonlinear  system  to  controlling  n 
uncoupled  linear  second  order  subsystems.  Each  subsystem  was  then 
stabilized  by  state  feedback.  Finally,  we  constructed  a  general 
feedback  control  algorithm,  which  can  be  implemented  on  a 
computer . 

We  illustrated  this  approach  by  two  design  examples. 
Computer  simulations  were  also  conducted  to  analyze  and  evaluate 
the  performance  of  the  nonlinear  controller.  The  simulation 
results  show  satisfactory  performances.  We  obtained  a  fast 


response  with  no  steadv  state  error  and  no  overshoot.  The 
performance  was  farther  improved  by  utilizing  the  time  varying 
feedback  gain  technique.  The  maximum  required  control  torques  and 
forces  were  reduced  and  Che  response  was  faster. 

In  practice,  it  is  desirable  that  a  control  system  can 
reject  disturbances,  parameter  variations  and  model  inaccuracies. 

We  tested  the  robustness  of  the  controller  by  introducing  an  error 
in  the  actual  computed  control  torques  and  forces.  It  was  found 
that  the  controller  is  indeed  robust.  This  may  suggest  that  if 
on-line  computation  of  the  controls  is  a  complex  problem,  the 
dynamic  equations  Cased  in  control  computations)  can  be  simplified 
without  loss  in  dynamic  performance. 

Also,  a  path  tracking  motion  in  joint  space  was  simulated 
for  both  the  feedback  Linearization  design  and  the  'computed 
torque'  technique  design.  The  simulation  results  show  a 
substantially  better  performance  for  the  feedback  linearization 
approach,  which  yields  smaller  tracking  error,  no  final  position 
error  and  lower  control  effort  requirements. 

Of  course,  it  would  be  interesting  to  see  the  results  of  a 
practical  implementation  of  this  nonlinear  control  scheme. 

However,  this  implementation  is  not  done  in  this  thesis  due  to  lack 
of  robot  hardware.  It  will  be  done  when  the  robot  hardware  s 
available  to  us  in  the  future. 

We  also  mention  that  in  spite  of  the  complexit.  • 
dynamic  equations  of  a  n-joint  robot  manipulator  • v 
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linearization  theorem  was  readily  applicable.  This  is  mainly  due 
to  the  fact  that  all  Kronecker  indices  are  all  less  than  or  equal 
to  two.  Because,  in  general,  the  conditions  for  linearization  are 
restrictive.  First  of  all  the  involutive  condition.  In  terras  of 
nonlinear  system  theory  in  general,  one  may  try  to  extend  the 
applicability  of  the  feedback  linearization  theorem  by  seeking 
other  canonical  forms,  in  addition  to  the  linear  canonical  form 
(4-2-3).  For  example,  a  nilpotent  Lie  algebra  with  a  special 
structure.  This  can  be  a  future  research  problem. 


tun 


"  A.v 


REFERENCES 


[1]  D.E.  Whitney,  'Resolved  motion  rate  control  of  manipulators 
and  human  prostheses.'  IEEE  Transactions  MMS-10,  1969. 

[2]  M.E.  Kahn  and  B.  Roth,  'The  near-miniraum-t irae  control  of  open 
loop  articulated  kinematics  chains.'  Trans.  ASME,  Journal  of 
Systems,  Measurement  and  Control  93,  1971. 

[3]  J.F.  Engleberger,  Robotics  in  practice:  Management  and 
applications  of  industrial  robots.  Amacon,  1980. 

[4j  J.Y.S.  Luh,  W.D.  Fisher,  R.P.  Paul,  'Joint  torque  control  by 
a  direct  feedback  for  industrial  robots.'  Proc.  20th  IEEE 
Conf.  on  Decision  and  Control,  San  Diego,  CA,  December  1981. 

[5]  D.F.  Golla,  S.c.  Garg,  and  P.C.  Hughes,  'Linear  state- 
feedback  control  of  manipulators.'  Mech.  Machine  Theory  16, 
1981 . 

[6]  M.H.  Raibert,  J.J.  Craig,  'Hybrid  posit  ion/ force  control  of 
manipulators.'  J.  Dynamic  Systems,  Measurement  and  Control 
102,  June  1981. 

[7]  S.  Dubowsky,  D.T.  DesForges,  'The  applications  of 
model-referenced  adaptive  control  to  robotic  manipulators. ' 

J.  Dynamic  Systems,  Measurement  and  Control  101,  1979. 

[8]  D.E.  Whitney  and  E.F.  Junkel,  'Applying  stochastic  control 
theory  to  robot  sensing,  teaching,  and  long  term  control.' 
Proc.  Amer.  Control  Conf.,  Alexandria,  VA,  June  1982. 

[9]  M.  Takegaki,  S.  Arimoto,  'A  new  feedback  method  for  dynamic 
control  of  manipulators.’  J.  Dynamic  Systems,  Measurement 
and  Control  102,  1981. 

[10]  C.S.G.  Lee,  M.J.  Chung,  ’An  adaptive  control  strategy  for 
mechanical  manipulators.'  IEEE  Trans,  on  Autom.  Control 
AC-29,  September  1984. 

[11]  T.  Eun,  Y.J.  Cho,  H.S.  Cho,  'Stability  and  positioning 
accuracy  of  a  pneumatic  on-off  servo  mechanism.'  Proc. 

Amer.  Control  Conf.,  Alexandria,  VA,  June  1982. 


[12]  T.L.  Johnson,  'On  feedback  laws  for  robotic  systems.'  Proc . 
8th  IFAC  World  Congress,  Japan,  August  1981. 

[13]  K.K.D.  Young,  'Controller  design  for  a  manipulator  using 
theory  of  variable  structure  systems.’  IEEE  Trans.  Systems, 
Man,  Cybernetics  SMC-8,1978. 

[14]  A.  Isidori,  A.J.  Krener,  C.  Gori-Giorgi,  and  S.  Monaco, 
'Nonlinear  decoupling  via  feedback:  A  differential  geometric 
approach.'  IEEE  Trans.  Autom.  Control  AC-26,  1981. 

[15]  W.  Respondek,  'On  decomposition  of  nonlinear  control 
systems.'  Systems  and  Control  Letters  1,  1982. 

[16]  A.J.  Krener,  'On  the  equivalence  of  control  systems  and 
linearization  of  nonlinear  systems.'  SIAM  J.  Control  and 
Opt.  11,  1973. 

[17]  J.L.  Sedwick,  D.L.  Elliott,  'Linearization  of  analytic  vector 
fields  in  the  transitive  case.'  J.  of  Diff.  Eqs.  25,  1977. 

[18]  R.  Su,  'On  the  linear  equivalence  of  nonlinear  systems.' 
Systems  and  Control  Letters  2,  1981. 

[19]  H.  Nijmeijer,  A.  van  der  Schaft,  'Controlled  invariance  for 
nonlinear  systems.'  IEEE  Trans.  AC-27,  1982. 

[20]  A.  Isidori,  'Lecture  notes  on  differential  geometric  control 
theory.'  (to  appear). 

[21]  H.  Hemarai,  P.C.  Camana ,  'Nonlinear  feedback  in  simple 
locomotion  system.'  IEEE  Trans,  on  Autom.  Control  AC-19, 
1976. 

[22]  E.  Freund,  'Fast  nonlinear  control  with  arbitrary  pole- 
placement  for  industrial  robots  and  manipulators.'  The 
International  Journal  of  Robotics  Research  1,  1982. 

[23]  R.W.  Brockett,  'Feedback  invariants  for  nonlinear  systems.' 
IFAC  Congress,  Helsinki,  1978. 

[24]  B.  Jakubczyk,  W.  Respondek,  'On  linearization  of  control 
systems.'  Bull.  Acad.  Pol.  Sci.,  Ser.  Sci.  Math.  Astronom. 
Phys.  28,  1980. 

[25]  L.R.  Hunt,  R.  Su,  G.  Meyer,  'Design  for  Multi-input  nonlinear 
systems.'  Differential  Geometric  Control  Theory,  Proc.  of 
conf.  held  at  MTU,  June-July  1982. 

[26]  R.P.  Paul,  Robot  manipulators:  Mathematics,  Programming,  and 
Control.  MIT  Press,  1981. 


97 


[27]  W.E.  Snyder,  Industrial  robots:  Computer  interfacing  and 
control.  Prentice  Hail,  1985  (to  appear). 

[28]  R.P.  Paul,  'Manipulator  cartesian  path  control.'  IEEE 
Trans,  on  Systems,  Man  and  Cybernetics  SMC-9,  November  1979. 

[29]  L.E.  Malvern,  Introduction  to  the  mechanics  continuous 
medium.  Prentice  Hall,  1969. 

[30]  B.R.  Markiewicz,  Analysis  of  the  computed  torque  drive  and 
comparison  with  conventional  position  servo  for  a  computer- 
controlled  manipulator.  JPL,  Tech.  Memo,  March  1973. 

[31]  A.K.  Bejczy,  Robot  arm  dynamics  and  control.  JPL,  Tech. 

Memo,  February  1974. 

[32]  R.P.  Paul,  'Modeling  trajectory  calculation  and  servoing  of  a 
computer  controlled  arm.'  Stanford  ARtif.  Intel.  Lab.,  A. I. 
Memo  177,  September  1972. 

[33]  M.H.  Raibert,  B.K.P.  Horn,  'Manipulator  control  using  the 
configuraion  space  method.’  Industrial  Robot  5,  1978. 

[34]  H.  Song,  'The  design  of  linear  time-varying  system  to  achieve 
fast  response  with  no  overshoot.'  Master's  thesis. 

University  of  Colorado,  Boulder,  1984. 

[35]  M.  Vukobratovic,  'Dynamics  of  active  articulated  mechanisms 
and  synthesis  of  artificial  motion.'  Mechanism  and  Machine 
Theory  13,  1978. 

[36]  Z.X.  Wan,  Lie  Algebras.  Pergaraon  Press,  1975. 

[37]  H.J.  Sussmann,  'Lie  brackets  and  local  controllability:  A 
sufficient  condition  for  scalar-input  systems.'  SIAM  J. 
Control  (to  appear). 

[38]  H.  Hermes,  Lecture  notes  Math  542.  University  of  Colorado, 
Boulder. 

[39]  H.  Hermes,  'Local  controllability  and  sufficient  conditions 
in  singular  problems.'  SIAM  J.  Control  and  Opt.  14,  1976. 

[40]  M.  Brady,  J.M.  Hollerbach,  T.L.  Johnson,  T.L.  Perez,  M.T. 
Mason,  Robot  motion:  planning  and  control.  MIT  Press,  1982. 

[41]  M.Y.  Wu,  'Solvability  and  representation  of  linear  time- 
varying  systems.'  Int.  J.  Control,  v.  31,  no.  5,  1980. 

[42]  T.  Kailath,  Linear  systems.  Prentice-Hall,  1980. 


B 


& 

-  * 

l\V 

r>. 

as 

F 


•\«V 

■V. 

*  "  «, 


APPENDIX  A 


PROGRAM  0RB(  I NPUT , OUTPUT ,  TAPE20  > 


C 

C 

C 

C 

C 


xxxxxxxxxrxxxxxxxx':xxxxx:**xxxxxxxxxx*xxxxxxxxxxxxxx*xxxx*xx 


SIMULATION  or  A  THREE  JOINT  ROBOT  MANIPULATOR- 
FEEDBACK  L i NEAR  I ZAT  1  ON  t  HUNT- SU -MEYER ) • 

TIME  VARYING  FEE08aCK  GAINS. 
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INTEGER  N.METH, MITER,  INDEX,  I WK ( 6 ) .  IER.K 
REAL  Y ( 6 ) , WK ( 1 50 ) , X , TOL , XEND , H , MR , ML , L 

COMMOM.-a/BI  ,B2.B3,S4,  AKR,  AMI  ,  AK2 
COMMON/B/Z1  ,  Z2 

0 1  MENS  ION  AZ  <  1 20 ,  1 0 ) , UU ( 1 20 ,  1 0 ) , T (1 20  > , 2  C 1 20 ,  1  0  > 

EXTERNAL  FCN.FCNJ 

N=6 

M=3 

X  =  0. 0 

y<  n*i 

Y  (  2  )  *0 
Y ( 3) *0 . 

Y(4)=0. 

Y(5>**1. 

Y  (  6  )  =0 
TOL-  00001 
H=0  C000001 
METHsl 

Ml TER*0 
I NOEX* 1 
MR* 1 0 . 

ML*  4  . 

L*  1  . 

21*15. 

22*2. 02«SQRT(  Z1  1 
B1 *ML*MR/4. 

B2*ML*<  3 .  'A .  ) *MR 
B3*ML*MR 
B4*MR*L 
00  1  I ■  1  ,  N 

AZf 1  ,  I  ) *Y( I  ) 

1  CONTINUE 
T ( 1  )  *0 
2(1,1 >  *Z1 
2 ( 1 , 2) *Z2 
DO  30  K*1 ,60 

XEN0*0  01 *FLOAT(K) 

CALL  OGEARfN, FCN.FCNJ , X, H, Y , XENO. TOL.METH, 

/  MITER, INDEX, IWK.WX, IER)  * 

I F ( IER.GT. 1281G0  TO  500 
DO  50  I  *  1 , N 

A2(K*1  ,  I  1  * Y ( I  ) 

50  CONTINUE 

UU(K ,  1  )  -  AKR 
UU ( K . 2 ) * AM 1 


UUIK, 3) =AKZ 
TIK+1 ) =  XZND 
ZIX*1 , 1 ) =Z1 
2  ( K*  1 , 2) =22 
30  CONTINUE 

WRI TEI 20 , 60) 

60  "ORMATI /2X, ' THREE  .IOI’"T  ROBO'r  NAN  I  PULA  TOR  ■  FEEDBACK  ‘  , 

/  IX,  LINEAR! IAT! ON' ) 

WR! TEI20, 62) 

62  FORMAT! 2X, 1 Tl HE  VARYING  FEEDBACK  GAINS,//) 

WRI TE(20, 100) 

1 00  FORMAT! //  ' , 3X ,  ' T I  ME '  ,  1 2X ,  ' R '  ,  I 3X ,  ’ X2 '  ,  1 3X ,  ' PH l  '  , 

/  1  3X  .  ’ X4 '  ,  1 4X  ' 2 '  .  1 3X ,  X6 ’ / ) 

WRI TE( 20,  135 M  T(K ) ,  ( AZ(K .  I  )  ,  1  =  1  , N) , K= 1  ,45,5) 

105  FORMAT ;7( 3X, F! 2. 5) ) 

V;P  I  TE  (  20 ,  1  20) 

120  FORMAT < / - 8X ,  TIME'  ,  I  1 X ,  ’ FR ' ,  1 4X ,  ' T '  ,  1 3X ,  ' F2  '  ,  1 3X ,  ' K 1  '  ,  1 3X ,  'K2 
WRI TEt  20,  1 25> I TIK* 1  ) ,  ( UU(K,  I  ) ,  1  =  1  , M) ,  ! ZIK*1  ,  I  ) ,  1=1 ,2)  , K= 1 , 60 ) 
125  FORMAT ( 6 ( 3X , F 1 2 . 5  > ) 

GO  TO  900 
500  CONTINUE 

WRITE (20, 550) 

550  FORMAT! //4X,  XI ' , 15X, ' X2 ' , 15X, ’X3' , 15X,  X4  '  , 

/  1 5X, ' XS ' , 15X . ' X6 ' / > 

WR I TE ! 20 , 600  >!Y(t),i  =  1.N) 

600  FCRMAT! 6' 2X, F I  5.  ’ 0)  ) 

WR I TE <  20 , * ) < XENO , H , X , METH , M I  TER ) 

900  STOP 
END 


SUBROUTINE  FCNJ ! N , X , Y , PO ) 
INTEGER  N 

REAL  YIN) , PO!N, N) , X 

RETURN 

ENO 


SUBROUTINE  FCN! N, X, Y, YPRIME) 

INTECER  N 

REAL  YIN) .YPRIME IN) ,X,L 
COMMON/ A /B 1 , 82 , S3, B4  AKR, AMI , AKZ 
COMMON/B/21 , 22 
L  =1  . 

W1  =0.5 
W2= -0  6 
W3=  *0 . 2 

ER=(W1 -Y( 1 ) )=«2*<W2-Y<3) >**2*IW3-Y(5) ) «»2 

I F ( ER-O. 2)9, 9, 8 

CC= 1 . 01 

GO  TO  13 

CC=  1.15 

Cl =82  =  V I  1  )=«2*(B4/1.  )*(L-3,  «Yt 1  )  ) 

FI  =  <B2»YI 1  ) - ( 3.  '8.  i *84 l « I Yt  4 ) «»2/B1  ) 

F2= I -2 . *B2«Y I  1  ) ♦<  3 . /4 ,  ) «B4 ) « I Y 1 2 ) » Y I  4 ) /Cl  ) 
C2= (B3/B1 1 =9 . 8 1 *S I NI Y I  3 ) ) 

C3= ( -3F*Y<  1  ) *84/2.  i*(9.81 *COSI YI 3) ) /Cl  ) 

El =21 *WM  -21 »Y I  1  ) -Z2«Y! 2 ) 

E2  =  21 -W2-Z1 »Y I  3 ) - 22*Y I  4 ) 

£3*21 «W3-Z1 * Y I  5  >  *22= Y  <  6 ) 


101 


AKR=B1  •  ( C2-F1 +  E1  ) 

AM)  --Ci  «  (  -C3-F2*E2> 
AKZ=B3*E3 

IFtAes;AM! ) -400  '20, 20, 10 
IFf  4BS(AKR) -400.  >30.30,10 
I F ( ABS ( AKZ ' -400.  >40, dO, 10 
I  Ff  Z1 -10000. >60,60  50 
Z1 «Z1 ■ 1 . 5 
Z2*2. *CC»SORT< Z1 > 

SO  TO  5 

ZW1/1 .5 

Z2-2 . «CC  *  SORT ( Z1  > 

El «Z1 *<W1 -Y ( 1 > > -Z2*Y(2> 

E2  =  Z1 «  <W2-Y( 3>) -Z2*Y(4) 

E3  =  Z1  «  C.V3-  Y(  5)  >  -Z2«  Y  (  6) 
MKR=B1 * ( C2 -F 1 ♦  £ 1 > 

AMI =C1 * ( -C3-F2*E2) 

AKZ=83  <E3 

IF> ABS< AKZ' -400  >1,1  10 
I F ( 4BS(«M1 ' -400  >2.2, 10 
I F ( ABS ( AKR  > -400.  >50.50,  10 
YPR I  ME ( 1  >  =Y( 2) 

YPR I  ME ( 2  >  *F 1 -C2*AKR/B 1 
Y  PR  I  ME  <  3  )  =  Y(  4  ! 

YPR I  ME <  4 ) =F2 *C3 ♦ ( AMI /Cl  > 
YPR I  ME ( 5  > =Y ( 6) 

YPR I  ME 1 6 ) s AK Z/B3 

RETURN 

ENO 


SIMULATION  OF  A  THREE  JOINT  ROBOT  MANIPULATOR 
COMPUTED  TORQUE  TECHNIQUE. 

TIME  VARYING  FEEDBACK  GAINS. 


* 


I NTEGER  N , METH , M I  TER , I NOEX , l WK ( 6  > .  I ER , K 
REAL  Y ( 6 ) . WK ( 1  SO ) , X , TOL , XENO , H , MR . ML . L 

COMMON/ A/BI , B2.B3.B4, AKR.AM1 ,  AKZ 
COMMON/B/Z1 , Z2 

DIMENSION  AZ( 1 20, 10) , UU< 1 20 , 1 0 > . T t 1 20 ) , Z ( 1 20 , 1 0 ) 

EXTERNAL  FCN.FCNJ 

N*6 

M«3 

x*o.  a 

Y< 1 )»1 . 

Y ( 2 ) =0 . 

Y ( 3 ) =0 . 

Y ( 4 ) *0 . 

Y(S)=-1 . 

Y(6)aQ. 

TOL3  00001 
H*0. 0000001 
METH* 1 
MI TER*0 
I NOEX3 1 
MR= 1 0 . 

ML*4. 

L*1  . 

Z1 *13. 

Z232 . «SQRT(Z1 ) 

B 1 *ML*MR/4 . 

B2»ML* ( 3 . /4 . )«MR 
B3«ML*MR 
B4=MR3L 
00  1  I  3  1  , N 

AZ( 1 , I >  3 Y ( I  ) 

CONTINUE 
T ( 1 >30. 

Z(1 , 1 )«Z1 
Z(1 , 2 ) 3  Z2 
DO  30  K»1 ,60 

X£N030.01«FL0AT(K) 

CALL  OGEAR ( N , FCN , FCNJ , X , H , Y , XEND , TOL , METH , 

’  MITER, INDEX, I WK.WK, iER) 

IF( IER.GT. 128)00  TO  SOO 
DO  50  I  3 1 , N 

AZIK ♦ 1 ,  I  ) »Y  < I ) 

CONT I NUE 
UU(K , 1 ) 3 AKR 
UU<  K , 2 ) 3 AMI 


UU!K, 3) =AKZ 
T I K  ♦  1 ) =  XENC 
ZIK  +  1 , 1 ) *Z1 
ZIK*1 , 2) *Z2 
30  CONTINUE 

WRITE! 10, SO) 

SO  F0RMAT(/2X,  THREE  JOINT  ROBOT  MAM I  PULA TOR -COMPUTED ' . 

/  IX, 'TORQUE  TECHNIQUE') 

WRITE! 10, 62) 

62  F0RMAT(2X. 'TIME  VARYING  FEEDBACK  GAINS.'//) 

WRI T£< 10, 100) 

100  FORMAT! ///, 8X, 'TIME1 , 12X, ' R ' , 1 3X , ' X2 ' , 13X,  PHI 
/  1 3X,  'X4‘ ,  14X,  ' Z' ,  1 3X.  X6' /  ) 

WRI TE! 10,  105) !T!K) ,  !AZ(K,  I  ) .  1  si , N> , K* 1 , 60) 

105  FORMAT ( 7 ! 3X , F 1 2  6 ) ) 

WRITE! 10, 120) 

120  FORMAT! //8X,  ' T I  ME '  ,  1 1 X .  ' FR '  ,  1 4X ,  ' T ’  ,  1 3X ,  ' FZ '  ,  13X,  K1  '  ,  13X,  'K2'/> 
WRI TE! 10,  125) !TIK*1  ) ,  (UU(K,  I  ) ,  I  =  1  ,M) ,  ! ZUOl  ,  I  )  ,  I =1  , 2) ,K= 1 , 60) 

125  FORMAT ( 6 f  3X , FI  2 . 5 ) ) 

GO  TO  900 
500  CONTINUE 

WRITE! 10,550) 

550  FORMAT! //4X,  'XI  ' , 15X,  X2 '  ,  1 5X ,  'X3 ' ,  15X,  ' X4  '  , 

/  1 5X , ' X5 ' , 1 5X , ' X6 ' / ) 

WRI TE! 10, 600) (Y(Ii,I»1,N) 

600  FORMAT! 6! 3X, FI  5.  10) ) 

WRI TE! 10,  « ) I XEND , H , X , METH , M I  TER ) 

900  STOP 
END 


SUBROUTINE  FCNJ ( N, X , Y , PD ) 
INTEGER  N 

REAL  YIN)  ,  POIN,  N)',  X 

RETURN 

ENO 


SUBROUTINE  FCNIN, X, Y, YPRIME) 

INTEGER  N 

REAL  YIN) , YPRIME! N)  ,  X,  L 
COMMON /A/BI , B2 , B3, B4, AKR, AMI  ,  AKZ 
CCMM0N/B/Z1 , Z2 
L  si  . 

X71 si  . 

X81 *0. 5 
X91 s-0. 3 
X72s0.3 
X  32*0. 5 
X92*0 . 5 
X73»0. 2 
X83*0 . 2 
)!93s0. 2 

Cl =B2*Y  < 1  >  »«2*<B4/4  ) * ( L  *  3 ,  'Y! 1  ) ) 

F1s(B2  YI 1 ) - ( 3 . /8  ) «B4  >  * ! Y ! 4  >  *  *2/B I > 

F?.~  !  -2,  *82* YI 1  )  *1  3.  /4  .  )  «B4  )  *  I  Yf  2)  <Y(4i/C1  ) 

C2= ( B3/B 1 ) *9  81 «SIN(Y!3) ) 

C3*  !  -83*Y  I  1  )  *B-1/2  .  )  *  !  9  81  *  COS !  Y  (  3  >  )  /C 1  ) 

El *31 . ( X73*Z1 s (X71 -Y< 1 ) ) *Z2« I X72-Y ( 2) ) ) 

AKR*  El  ♦  (  -B2«Y  (  1  )  ♦  (  3  .  /8  )  «f.4  )«Y(4)!:*2*83r9. 81  »SIN(Y(3)  ) 


5 


E2*C1 * (X83+Z1 * ( X81 -Y( 3) ) ♦22* ( X82-Y ( 4 )  )  ) 

E3  =  £2  +  <  2. *B2*Y (1 )-<3./4. >*B4)*YC2)*Y(4) 

AMI *E3+<B3*Y( 1 ) -B4/2. ) *9. 81 *COS( Y( 3) ) 

AKZ*B3*<X93+Z1 *<X91 -Y( 5> ) +22* ( X92 - Y ( 6 ) ) ) 

! F { ABS ( AMI ) -5000.  >20,20,10 
IF(ABSCAKR) -5000.  >30, 30, 10 
IF(ABSCAKZ) -5000. >40,40,10 
I F ( 21 -50000. >60,60,50 
21*21*2.3 
22=2 . *SQRT ( 21 ) 

GO  TO  5 
21 *Z1 /2. 3 
22=2. *SGRT ( 21 ) 

El *B1 * (X73  +  21 * (X71 -Y< 1 ) ) +22*  t X72-Y ( 2 ) ) ) 

AKR*£1 ♦ ( -82* Y( 1 ) +( 3. /8.  > *B4 ) *Y ( 4 ) * *2+83=9  81 *SIN(Y(3>) 
E2*C1 *(X83  +  21 *(X81 -Y(3>  > +22* ( X02-Y ( 4 ) >  > 

E3*E2+ ( 2 . *B2*Y ( 1)-(3./4.  >*B4)*Y(2)*Y(4> 

AMI =E3+ ( B3*Y ( 1 > -B4/2. ) *9. 81 *COS(Y( 3) ) 

AK2=B3* ( X93+21 * (X91 -Y(5) ) +22* ( X92 - Y ( 6 > ) > 

YPRIMEf 1) =Y<2) 

YPR I ME( 2 ) =F1 -C2+AKR/B1 
YPR I ME( 3  >  * Y ( 4 ) 

YPR I ME( 4)*F2+C3+( AMI /Cl > 

YPR I ME( 5 ) =Y ( 6  > 

YPR I  ME ( 6 ) * AK2/S3 
RETURN 


<.  ■ 


In  robot  control,  variable  effective  inertia  and  gravity  loading  effects  suggest 
the  imperative  need  for  time- varying  models.  It  has  been  shown  [1]  that  proper 
use  of  time-varying  controller  can  produce  fast  robot  manipulator  motion  without 
causing  undesirable  overshoot.  Most  of  currently  available  methods  of  controller 
design  require  precise  parameter  values  of  the  plant,  which  are  often  impossible  or 
impractical  to  have  in  practice.  In  recent  years  a  new  control  design  philosophy 
has  emerged.  This  is  the  intelligent  control  [2].  An  intelligent  control  is  capable 
of  updating  its  control  strategies  through  learning.  It  is  operating  on  heuristic  as 
well  as  analytic  reasoning.  It  employs  both  quantitative  and  qualitative  information 
in  its  decision  making  processes.  In  fact,  very  often  the  qualitative  information  is 
placed  higher  than  the  quantitative  information  in  its  ruled-based  decision  hierarchy. 


Stability  is  the  first  requirement  in  any  satisfactory  control  action,  stability 
criteria  for  linear  time-invariant  systems  are  widely  available,  but  explicit  and  prac¬ 
tical  stability  criteria  for  time-varying  systems  are  still  lacking.  However,  explicit 
stability  criteria  for  certain  special  classes  [3]  of  linear  time-varying  systems  axe 
available.  An  intelligent  control  is  inherently  a  time-varying  control.  In  the  fol¬ 
lowing  we  shall  examine  the  stability  of  a  special  class  of  peridically  time-varying 
system.  The  effect  of  rate  of  parameter  variations  on  stability  will  be  examined  in 
detail.  We  believe  the  results  obtained  here  will  provide  the  qualitative  information 
that  may  provide  a  general  guideline  for  designing  an  intelligent  control  of  robot 
manipulation.  As  will  be  seen  in  subsequent  analysis  rate  of  parameter  variations 
appears  to  be  most  critical  to  stability  for  frequency  in  a  band  centering  around  the 
” resonant  frequency”  of  its  constant  nominal  system. 

Consider  a  linear  time- varying  system 

x(f)  =  Az(t)  +  Bu(t)  (1) 

the  stability  of  A(t)  in  general  is  still  an  open  problem.  However  stability  for 
special  A(t)  can  be  precisely  determined.  Results  obtained  from  such  analysis  may 
provide  useful  qualitative  information  in  the  design  of  intelligent  control  of  robot 
manipulator.  To  deal  with  a  manageable  problem  at  this  point,  we  shall  assume 
that  A(t)  can  be  separated  into 

A(t)  =  .40  +  Ai(t) 

where  .40  is  a  constant  nominal  part  and  Ai(t)  is  a  time- varying  part  that  represents 
varying  parameters  in  the  system.  It  is  further  assumed  that  associated  with  .4]  (0 
there  is  a  parameter  ui  that  governs  the  rate  of  parameter  variations  in  the  system. 
The  parameter  u>  cam  be  viewed  as  a  quantity  that  specifies  how  fast  the  time- 
varying  controller  is  changing  or  how  fast  the  variable  inertia  or  gravity  loading  of 
robot  manipulator  may  change  in  its  operation.  The  important  question  of  interest 


lL 


is  how  uj  will  affect  the  stable  operation  of  the  system,  i.e.  how  u  affects  the  stability 
of  the  system.  To  gain  insight  into  this  problem,  we  shall  examine  the  following 
special  system.  Consider  the  system  (1)  with  A(t)  being 

a  4-  7  cos  ut  +  6  sin  ut  0  +  6  cos  ut  —  7  sin  ut  (e>. 

'  '  ~  —  0  4.  6  cos  ut  —  7  sin  ut  a  —  7  cos  ut  —  6  sin  ut 


Equation  (2)  above  can  be  written  as 


A(t)  =  ,4q  + 


-4q  — 


—0  a 


Ai  = 


7  cos  ut  +  6  sin  ut  6  cos  ut  —  7  sin  ut 
6  cos  ut  — )  sin  ut  —7  cos  ut  —  6  sin  ut 


where  Ao  represents  the  constant  nominal  part  and  represents  the  time- 

varying  part  due  to  parameter  variations.  It  should  be  commented  that  Ai{t)  can 
also  be  written  as 


*<‘>-(1  4)C: 


cos  ut  —  sin  ut 
sin  ut  cos  ut 


.m  =  ( _c 


cos  ut  sin  ut 
-  sin  ut  cos  ut 


)  (l  4) 


=  L(t)C 


These  interpretations  permit  us  to  examine  the  time-varying  effects  of  control  or 
feedback.  It  can  be  shown  [4]  that  A(t)  in  (2)  is  stable  if  and  only  if 

a  <  0 


a2  +  02  -  72  -  62  +  - -  0u  >  0 

4 

It  will  be  interesting  to  see  the  physical  significance  of  those  conditions.  Note  that 
the  constant  nominal  system  Ao  is  stable  if  and  only  if  a  <  0.  Therefore  for  the 
time- varying  system  A(t)  to  be  stable,  it  is  necessary  that  the  constant  nominal 
system  be  stable.  However  such  a  condition  will  not  be  in  general  sufficient  to 
ensure  the  stability  under  parameter  variations.  It  is  intuitively  clear  that  the  rate 


of  variations  in  the  parameters  should  affect  the  overall  stability.  Here  we  like  to 
examine  the  effect  of  ui  on  stability  in  more  detail. 

The  effect  of  on  stability  can  be  examined  through  the  following  condition: 


f(oj)  =  -  (f)  +  a2  -  (72  +  62)  >  0 


It  is  easy  to  see  that  if  the  constant  nominal  system  .-io  has  sufficient  damping  to 
suppress  the  perturbation  induced  by  parameter  variations,  namely  if 

a2  >  7 2  +  6 2 

then  the  above  time- varying  system  is  stable  for  all  u ;.  In  other  words,  if  the 
magnitude  of  parameter  variations  is  not  large  enough  to  upset  the  stability  of  the 
constant  nominal  system,  then  the  rate  of  parameter  variation  has  no  bearing  on 
the  overall  stability. 

It  can  also  be  seen  that  if  the  magnitude  of  variations  is  sufficient  to  cause 
instability  i.e.  if 

7 2  +62  >  q2 

great  attention  should  be  paid  to  the  rate  of  variations.  It  is  found  that  stability  is 
determined  by  a  critical  frequency  band  [uq,u>2](to  be  called  the  instability  zone). 

where  _ 

uq  =  2  —  \Zl2  +  62  —  <*2J  (7a) 

and  _ 

uj2  =  2  ^13  +  \/72  +  62  —  a2 J  (76) 

The  system  is  unstable  if  u;  £  (aq,^]  and  it  is  stable  if  u  £  [uq, u,^].  Not  that 
the  instability  zone  is  centered  at  2/?,  twice  the  damped  frequency  of  the  constant 
nominal  system,  rather  than  centering  at  its  undamped  natural  frequency  given  by 
ujn  =  as  intuition  may  suggest. 


Some  useful  observations  concerning  stability  of  time-varying  system  are  sum¬ 
marized  below: 

1.  Magnitude  of  variations  is  found  to  be  more  significant  than  the  rate  of 
variations  in  affecting  the  system  stability. 

2.  For  a  time-varying  system  to  be  stable,  very  often  it  is  necessary  to  have  a 
stable  constant  nominal  system. 

3.  If  the  magnitude  of  variations  is  lore  enough  to  upset  the  stability,  the 
system  tends  to  have  the  worst  destabilizing  effect  when  the  rate  of  variation  is  in 
or  near  the  instability  frequency  band  centred  at  twice  the  damped  frequency  of  its 
constant  nominal  system.  In  other  words,  it  is  important  to  avoid  excting  possible 
resonance.  It  is  also  noted  that  very  fast  variation  (when  u>  is  large)  or  very  slow 
variation  (when  u>  is  small)  tend  to  offer  better  chance  of  avoiding  instability. 


of  variations  in  the  parameters  should  affect  the  overall  stability.  Here  we  like  to 
examine  the  effect  of  u>  on  stability  in  more  detail. 

The  effect  of  ui  on  stability  can  be  examined  through  the  following  condition: 

/M  =  (^  ~  #)  +  a2  ~  (~t2  +  62)  >  0  (C) 

It  is  easy  to  see  that  if  the  constant  nominal  system  .-lo  has  sufficient  damping  to 
suppress  the  perturbation  induced  by  parameter  variations,  namely  if 

a2  >  72  4-  S2 


then  the  above  time-varying  system  is  stable  for  all  u>.  In  other  words,  if  the 
magnitude  of  parameter  variations  is  not  large  enough  to  upset  the  stability  of  the 
constant  nominal  system,  then  the  rate  of  parameter  variation  has  no  bearing  on 
the  overall  stability. 

It  can  also  be  seen  that  if  the  magnitude  of  variations  is  sufficient  to  cause 
instability  i.e.  if 

7 2  -M2  >  a2 

great  attention  should  be  paid  to  the  rate  of  variations.  It  is  found  that  stability  is 
determined  by  a  critical  frequency  band  [uq,u;2)(to  be  called  the  instability  zone), 
where 


The  system  is  unstable  if  u>  €  and  it  is  stable  if  u>  £  [uq,^].  Not  that 

the  instability  zone  is  centered  at  20,  twice  the  damped  frequency  of  the  constant 
nominal  system,  rather  than  centering  at  its  undamped  natural  frequency  given  by 
u/n  =  y/a*  +p,  as  intuition  may  suggest. 


Some  useful  observations  concerning  stability  of  time- varying  system  are  sum¬ 
marized  below: 

1.  Magnitude  of  variations  is  found  to  be  more  significant  than  the  rate  of 
variations  in  affecting  the  system  stability. 

2.  For  a  time-varying  system  to  be  stable,  very  often  it  is  necessary  to  have  a 
stable  constant  nominal  system. 

3.  If  the  magnitude  of  variations  is  lore  enough  to  upset  the  stability,  the 
system  tends  to  have  the  worst  destabilizing  effect  when  the  rate  of  variation  is  in 
or  hear  the  instability  frequency  band  centred  at  twice  the  damped  frequency  of  its 
constant  nominal  system.  In  other  words,  it  is  important  to  avoid  excting  possible 
resonance.  It  is  also  noted  that  very  fast  variation  (when  u  is  large)  or  very  slow 
variation  (when  u i  is  small)  tend  to  offer  better  chance  of  avoiding  instability. 
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Abstract 

The  possibility  of  controlling  dynamical  systems  under  incomplete  and  even 
very  small  a  priori  information  is  based  on  the  application  of  adaptation  and  learn¬ 
ing  in  automatic  systems  which  reduces  initial  uncertainty  by  using  the  information 
obtained  during  the  process  of  control.  It  goes  without  saying  that  adaptive  tech¬ 
niques  in  control! but  also  in  filering  and  prediction)  have  been  extensively  studied 
for  over  a  decade,  and  not  long  ago  a  rigorous  and  comprehensive  theory  of  conver¬ 
gence  of  adaptive  algorithms  has  emerged  [2].  Also  in  practice  numerous  successful 
applications  have  been  reported  ([1],  [2]).  At  the  same  time  little  attention  has 
been  given  to  "learning”  and  “self  learnining”  in  the  world  of  control  engineering 
until  recently  when  S.  Arimoto  and  his  co-workers  [3]  proposed  a  learning  control 
method  called  betterment  process  based  on  a  simple  iterative  algorithm,  which  was 
successfully  applied  to  the  motion  control  of  robots[4].  A  closer  look  at  the  better¬ 
ment  process  may  lead  to  other  alternatives  and  perhaps  to  a  unified  (generalized) 
method.  The  developement  of  such  learning  control  methods  along  with  possible 
applications  and  problems  for  research  in  learning  control  form  the  basis  of  this 
thesis.  Before  presenting  the  thesis  outline  it  is  important  to  notice  that  the  terms 
“adaptation”  &  “learning”  do  not  have  a  unique  interpretation  and  usually  their 
definitions  vary  from  one  author  to  another  or  even  from  one  technical  paper  to  an¬ 
other  [1].  It  is  certainly  not  the  purpose  of  the  author  to  get  involve  in  the  technical 
terminology  but  since  “learning”  seems  to  be  easier  to  understand  in  the  context  of 
this  thesis,  it  will  be  therefore  used  exclusively  (unless  otherwise  stated). 

Outline  of  the  first  part 

This  part  of  the  thesis  consists  of  the  following  four  sections.  In  the  first 
section  the  betterment  theory  is  briefly  reviewed  :  mathematical  background,  main 
theorems,  and  different  schemes  are  introduced.  Some  basic  remarks  and  notes  end 
this  section.  Section  two  contains  the  theoretical  basis  of  learning  methods  based  on 
function  decomposition,  to  do  that  some  results  from  linear  algebra  and  functional 
analysis  are  needed.  In  the  third  section  a  more  powerful  and  general  algorithm  is 
developed.  Remarks  and  discussions  concerning  the  limitations  and  applications  of 
the  algorithm  conclude  this  section.  In  the  last  section  of  this  part  of  the  thesis 
the  extension  of  the  decomposition  techniques  to  a  class  of  nonlinear  systems  is 
investigated,  it  is  also  shown  that  a  combination  in  a  certain  way  of  the  betterment 
algorithm  and  the  decomposition  techniques  can  yield  to  a  faster  learning  . 

1.0  Betterment  Processes 

In  this  section  an  algorithm  called  betterment  process  for  linear  systems  is 
introduced.  This  algorithm  updates  the  control  input  based  on  the  previous  oper¬ 
ation  data  and  “betters”  the  performance  of  the  next  operation  in  a  certain  sense, 
provided  a  desired  output  is  given.  In  this  section  three  types  of  learning  control 
scheme  are  discussed. 


1.1  Background 

Consider  the  linear  time  invariant  continuous  system  described  by  the  following 
state  equations: 

X(t)  =  AX(t)  +  Bu(t)  (la) 

y(t)  =  CX(t)  (ib) 

where  t  represents  the  time  in  the  interval  [0,T].  If  we  assume  that  for  each  run  the 
initial  state  A'*(0)  is  the  same  fixed  state  x°  then  in  the  k-th  run  the  output  of  the 
system(l)  is 


Vk(t)  =  CeAtXk(0)+  [  CeMt~r)Buk(T)dr 

Jo 

=  CeAtx°  +  f  CeA(t~T)Buk(T)dT  (1.2a) 

Jo 

=  g(t)+  [  h(t  —  r )uk(r )dr  (1.2 b) 

Jo 

From  (1.2)  it  is  clear  that  g(t)  and  h(t)  are  the  same  for  each  run  in  the  interval 
[0,T],  That  is 

g(t)  =  CeAtx°  (1.3a) 

h{t)  =  CeAtB  (1.3  b) 

Definition  1.  (see  [3], [5])  A  linear  time  invariant  system  described  by  (1.2)  is  said 

to  be  strictly  positive ,  if  for  any  T  >  0  and  any  u(t),  t  is  in  [0,  T\  the  following 
inequality  is  satisfied  with  some  constant  a  >  0 


uT(t)  h(t  —  t )  u(r)  dr  dt  >  a 


uT(t )  u(t)  dt 


(1-4) 


Definition  2.  Given  a  vector  valued  function  u(t),  t  in  the  interval  [ 0,T ']  then  the 
L2-norm  of  u (t)  is  defined  by 

Hull  = 

Definition  3.  The  spectral  radius  y0  of  a  matrix  A  is  defined  as 

7o  =  p{*4}  =  max  |A| 

A€<r(.t) 

where  afA]  represents  the  set  of  the  eigenvalues  of  the  matrix  A. 

Definition  4.  A  rational  transfer  function  matrix  H(s )  is  said  to  be  proper  if 

lim  H(. s)  <  oc 


i: 


ur(f)  u(t)  dt 


1/2 


(1.5) 
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and  strictly  proper  if 

lim  £T(s)  =0  (zero  matrix) 

3  —  OO 

In  the  scalar  case,  a  transfer  function  is  proper  if  the  degree  of  the  numerator 
polynomial  is  less  than  or  equal  to  the  degree  of  the  denominator  polynomial. 


1.2  C°-Type  Betterment  Process 

Consider  the  linear  time  invariant  continuous  system  described  by  (1)  where 
the  input  vector  u(t),  and  the  output  vector  y(t)  have  the  same  dimension.  Also 
suppose  that  a  desired  output  vector  yd(t)  is  given  over  the  interval  [0,  T],  then  the 
C°  -type  betterment  process  is  defined  by 

Mfc+i(0  =  Uk(t)  +  refe(t)  (l.G) 

where 

e*(<)  =  yd(t)  ~  Vk(t) 

and  T  is  an  m  x  m  constant  gain  matrix  to  be  defined.  yi{t)  and  uk(t)  are 
respectively  the  m-dimensional  output  and  input  of  the  system  (1)  in  the  k-th  run. 

Theorem  1.  Suppose  that  the  linear  time  invariant  continuous  system  (1)  is 
strictly  positive  and 

r  =  yi  (i.s) 

I  is  the  m  x  m  identity  matrix  and  7  is  a  sufficiently  small  and  positive  constant. 
Then  the  C°  -type  betterment  process  is  convergent  in  the  sense  that 

ll«*+i!l<PlM  d-9) 


where 


0  <p  <  1 


The  proof  of  this  theorem  can  be  found  in  [3J. 


1.3  C1  And  Mixed  Type  Betterment 

The  Cl-type  betterment  is  described  by  the  following  simple  iterative  rule  of 
input  modification: 

uk+,(f)  =  ut(f)  +  r^{jw(f)  -  yk(t)}  (1.10) 

Here  also,  u*(t)  and  yt(<)  are  the  m-dimensionai  system  input  and  system  output 
respectively  (the  system  under  test  is  system  (1)).  The  constant  T  is  an  m  x  m 
constant  matrix  called  the  “gain  matrix 
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Theorem  2.  The  C1  -type  betterment  process  defined  by  (1.10)  converges  in  the 
sense  that  as  k  — »  oo  y*(i)  — *  yd(t)  uniformly  in  t  over  [0,X]  if  the  following 
conditions  are  satisfied: 

1: 

II Im  -  CBriloo  <  1 

where  ||.||oo  is  the  matrix  norm  induced  by  vector  norm  v  and 


IMloo  =  max  |w,-| 
l<»<m 


2: 


u0(t),yd(t)  are  in  C'[0,  T] 


3: 

Vd(0)  =  Cx° 


The  proof  of  theorem  2  can  be  found  in  [3]. 

Obviously,  since  the  parameters  of  the  system  (1)  are  assumed  to  be  unknown 
condition  1  does  not  help  in  choosing  the  gain  matrix  F.  In  comparison  with  the 
above  two  types  of  betterment,  it  is  also  useful — as  will  be  shown  in  the  next 
subsection — to  mention  the  so  called  mixed-type  betterment  process  defined  by: 


«*+i(0 


uk{t)  + 


(•+*) 


ek{t). 


(1.11) 


1.4  Remarks  And  Discussion 

In  the  above  subsections  different  function  norms  have  been  chosen  in  evalu¬ 
ating  the  performance  of  different  types  of  betterment  processes.  For  the  C'-type 
betterment  the  uniform  norm  ||.||a  becomes  inadequate  when  the  considered  time 
interval  [0,T]  expands,  because  the  constant  A  becomes  too  large.  Although  it  can 
be  shown  that  for  an  asymptotically  stable  linear  system  a  fixed  value  for  A  is  per¬ 
mitted  for  any  expansion  of  the  interval  [0,T],  it  may  be  more  suitable  to  choose 
some  other  kind  of  function  norms,  especially  when  the  desired  output  yd(t)  is  de¬ 
fined  over  the  semi-infinite  interval  [0,  oo).  For  example,  if  we  consider  the  L-2  norm 
for  the  Cl-type  betterment  process  and  the  desired  output  yd(t)  is  defined  over  the 
semi-infinite  interval  [0,  oo)  then  it  can  be  shown  that 


e*+i(0  =  (Im  -CDT)ek(t)  - 


CAe-4(,-r)Brefc(r)dr 


(1.12) 


provided  that  all  eigenvalues  of  matrix  A  have  negative  reed  parts.  Now  if  we  take 
the  Laplace  transform  of  the  above  equation  and  denote 


=  C{h+\{t)} 


4 


then  we  get 


By  putting 


Ek(s)  =  C{ek(t)} 

H(s)  =  C(sl  —  A)~XB 

Ek+1(s)  =  (Im~sH(s)T)Ek(s ) 

7  =  sup  p{Im  -  juH(juj) f} 


(1.13) 


(1.14) 


(1.10) 


where  p{A’}  is  the  spectral  radius  of  the  matrix  X  and  ||E||  is  defined  for  a  vector 
valued  function  E  =  (E\, ...,  Em)T  by 


/+co  m 

£  \Ei(jv)\2  dw 

■°°  i=i 


we  get 


||Efc+1||<  711^11 

which  implies — according  to  Parseval’s  equality — the  following  inequality 

l|e*+i||  <  7l|e*ll 


(1.17) 


(1.1S) 


Therefore  we  can  conclude  that  the  betterment  process  converges  in  the  L2- 
norm  sense  if  7  is  less  than  unity.  However  this  is  impossible  unless  the  denominator 
of  H(s)  vanishes  at  s=0  as  one  can  see  from  equation  (1.15).  This  same  remark  is 
also  relevent  for  the  C°  -  type  betterment  process.  In  this  case  and  considering  the 
frequency  domain  (as  was  done  above)  the  convergence  of  the  process  is  assured  if 


7o  =  supp{Jm  -  H(ju)$)  <  1 


(1.19) 


However  this  condition  is  not  always  satisfied  for  linear  causal  systems  with 
a  proper  transfer  function  matrix,  because  in  general  for  such  systems  H(s)  — *• 
0  as  s  — ►  00.  In  view  of  these  arguments  the  convergence  of  the  mixed  type 
betterment  process  can  be  assured  if  one  can  choose  the  appropriate  matrices  T 
and  $  such  that 

7  =  sup p{Im  -  ($  +  Tjuj)H{ju)}  <  1.  (1.20) 

which  leads  to  the  following  question  :  What  is  the  class  of  linear  dynamical  sys¬ 
tems  for  which  such  matrices  $  and  T  exist  ?  The  study  of  this  question  and  the 
question  of  using  other  types  of  betterment  process  with  different  function  norms 
are  interesting  subjects  of  research. 

Before  ending  this  section  it  is  necessary  to  mentione  that  for  the  above  al¬ 
gorithms  it  is  sufficient  (but  not  necessary)  to  reset  the  initial  state  in  each  run 
however  it  is  necessary  that  for  each  run  (1.3a)  holds.  In  this  case  condition  3  of 
theorem  2  changes  to 

y<j(0)  =  yt(0)  for  all  k 


SOT 


Example  1.1 

To  see  the  applicability  of  the  scheme  presented  in  this  section,  consider  the 
following  system: 

X  =  AY  +  Bu 

rY  (1-21) 
y  =  CX 

Where 


-1  -2 


C  =  [0  1 


and  the  desired  output  is 


yd(t)  —  12t2( 1  —  t)  (1.22) 

t  6  [0, 1].  The  CJ-type  betterment  has  been  chosen  for  this  example.  Since  CB  =  1 
the  assumption 


Hi-rcBHoo  <  i 


is  now  given  by: 


(1.23) 


|l^-r|<l  (1.24) 

In  this  particular  example  we  choose  the  constant  T  to  be 

r  =  1  (1.25) 

Figure  (1.1)  shows  that  in  a  few  iterations  ( k  =  4)  the  desired  output  i/d(f)  is 
achieved.  Finally,  it  should  be  noted  that  it  is  impossible  to  choose  y(t)  =  i\(t)  as 
output  of  the  system  (1.21)  because  in  this  case  CB  =  0  and  there  exist  no  T  for 
which  (123)  can  be  satisfied. 


2.0  Betterment  Process  Based  On  The  Decomposition  Of  Functions 

In  this  section  an  iterative  method  for  betterment  process  is  developed.  The 
desired  output  y<i(t)  is  a  continuous  function  defined  over  the  interval  [0,T]  where 
T  <  o o  is  a  given  constant.  The  desired  output  yj{t)  of  a  single-input  single¬ 
output  linear  time  invariant  continuous  system  is  expressed  as  a  linear  combination 
of  functions  from  a  complete  orthonormal  set  given  apriori.  The  iteration  method 
is  then  applied  on  such  decomposition. 


2.1  Theoretical  Background 

The  following  definitions  and  theorems  are  needed  in  the  developement  of  the 
scheme  presented  in  this  part. 


Definition  5.  A  set  of  functions  o,(t)  i  =  1,2 _ defined  on  an  inten-al  1 1 .  f 2 j 


is  said  to  be  orthogonal  if 


*  -  { 2;.  ;£j 


and  orthonormal  if  in  (2.1)  the  constant  k,  =  1 

Definition  6.  The  orthonormal  set  o2(t), . . .}  is  said  to  be  complete  if  and 

only  if  it  is  not  a  subset  of  a  larger  orthonormal  set. 

Let  the  approximation  of  f(t)  on  [f  i ,  <2]  for  a  given  set  of  n  orthogonal  functions 
he 

/(f)  as  cibi(t)  +  c2<j>2(f)  +  ■  ■  ■  +  cnon(t)  (2.2) 

the  mean  square  error  (MSE)  between  the  true  value  of  f(t)  and  the  approximation 
£"=1  Ci<f>i{t)  is  given  by 


mse  =  - — 

-  <1  Jt\  [  “1 


It  can  be  shown  that  the  best  approximation  of  f(t)  on  [fi,t2]  in  the  sense  of 
minimum  mean  square  error  is  when  c,,  i  —  1, 2, . . . ,  n  are  chosen  as  follows 


Cj  —  Ti/^»  ^  —  1, 2, . . . ,  n 


where 


ft2 

7i  =  /  /(*)  ^ 

Jt  1 

t,  =  r  tf(t)dt 

in 


( t )  dt 


Definition  7.  The  function  f(t)  =  f(e,N,  <£)  defined  in  the  interval  [fi,f2]  is  said 
to  be  an  e  -approximation  of  a  given  function  f(t)  on  the  interval  [f  t ,  t2],  with  respect 
to  a  chosen  set  of  orthogonal  functions  described  by  the  vector  valued  function: 

$  =  [<t>l(t),  <t>2(t),  fait),  ■  ■  ■  , 

if  for  a  given  positive  e  there  exists  a  number  N  such  that  for  all  t  in  [ti,  f2] 

11/  —  /||  <  e  for  n  >  N 

lemma  1.  Let  the  reed  valued  function  f(t)  be  square-integrable  on  the  inten-al 
(f  1 ,  f2]  then  the  e-approximation  of  f(t)  with  respect  to  <1*  is  given  by 

.v 

/(f)  = 


where  c,  are  defined  by  (2.4)  and  (2.5) 

There  are  many  sets  of  functions  that  can  be  used  to  represent  a  function  on  an 
interval  For  examples  the  sets  of  {cos(n*f)}. 

{sm(nu-’f)}.  the  set  of  Walsh  functions  and  the  set  of  Legendre  functions  defined  on 
[— 1,  1]  by 


>„(*)  =  l/x/2,  Ox(t)  =  ty/3/2,...d>n(t)  = 


!2n  +  1 


Pnit) 


where  the  Legendre  polynomials  p„(f)  are  generated  by  the  formula 

Pn(i)  =  2Prd.  dr  ~ 

In  this  part — as  previously — the  L2  norm  will  be  used.  For  that  reason  the 
following  two  theorems  (see  [6]  for  proof)  will  be  important  in  establishing  the  rela¬ 
tion  between  the  concept  of  multiplier  (linear,  continuous  time  invariant  operator) 
and  the  concept  of  transfer  functions  of  the  space  A'(Q)  which  contains  all  complex 
valued  functions  s  — +  H(s)  of  the  complex  variable  s,  and  it  is  bounded  and  holo- 
morphic  in  the  open  right  half  plane  {s  :  Re(s)  >  0}.  AT( 0)  is  normed  algebra  under 
the  pointwise  multiplication  of  functions  and  under  the  norm 

\\H\\=  sup  \H(s)\ 

«e(s)>0 

Theorem  3.  (L2-  representation  theorem)  Let  M(L2)  represent  the  algebra  of  all 
multipliers  in  L2.  Then  there  exists  an  isomorphism  of  rings  such  that  to  each  A  in 
M(L2)  there  is  assigned  a  transfer  function  H  in  A’(0)  satisfying 
(0 

C(Af)(s)  =  H(s)C(f)(s) 
for  all  f  in  L2  and  all  Re(s)  >  0 

(u) 

mu  =  \m 

For  completeness  the  Ln2-representation  theorem  is  also  given. 

Theorem  4.  (L n 2-representation  theorem)  The  ring  of  all  multipliers  in  Ln 2  is 
isomorphic  with  the  ring  of  all  nx  n  matrices  over  A'(0),  in  such  a  way  that  to  each 
A  of  \I{Ln 2)  there  is  assigned  A  in  A'(0)n*n,  called  the  matrix  transfer  function  of 
A  such  that 

(CAf)(s)  =  A(s)(Cf)(s) 

for  all  f  in  Ln 2  and  all  Re(s)  >  0 

The  proofs  of  the  above  two  theorems  are  found  in  (Gj. 


2.2  Developement  Of  The  Scheme 


S 


2.2.1  Developement  Of  The  Iteration  Method 

Let  the  set  {<?i(t),  02(f),  •••,  0n(f)}  represent  a  set  of  orthonormal  functions 
defined  on  [0.  T\  where  T  >  0  is  a  given  constant.  Consider  also  the  vector  valued 
function 

$  =  02(f),  03(f),  •  •  •  ,  0n(f)]T 

Let  be  the  e  -approximation  of  the  desired  output  of  a  single-input  single¬ 

output  time  invariant  continuous  system  and  defined  as 

n 

jjdjt)  =  y^a?0,-(t)  (2.C) 

i=i 

and  let  the  input  and  the  output  of  the  k-th  run  be  respectively  e  -approximated 

by 

n 

=  (2.7) 

1=1 
n 

fc(f)  =  (-•$) 

1=1 

From  (2.6)  and  (2.8)  we  get 

n 

e*(f)  =  yd{t)  -  yii(t)  =  ]P(af  -  of  )4>,{t)  (2.9) 

1=1 

In  this  case  we  can  use  the  following  vector  representation 

*0i(f)‘ 

02(f) 

e*W  =•!#.&* . A\  .  -2T(*>*  (2-10> 

where  /?(fc)  =  [y3f , > ••  • , /?n]T  Similarly  we  have 

y*(<)  =  or(fc)$,  o(fc)  =  [af,af, - af]T  (2.11) 

and 

y</(f)  =  d  =  [a?,a2i,...,o']r  (2.12) 

If  we  consider  the  C°-type  betterment  we  get  for  t  in  [0,  T\ 

n 

u*+1(f)  =  £df+l0,(O  (2.13) 

i=i 


where 


tf+1=tf +7  (af-a?) 


(2.14) 


In  vector  notation  equation  (2.14)  is  equivalent  to 


(3{k  +  l)  = /3(k)  +  ~f(d  — a(k))  (2.15) 

Now  consider  that  yk{t)  and  Uk(t)  are  related  by  a  linear  operator  L  as  follows  : 


ijk(t)  =  L(uk(t )) 


(2.1G) 


Therefore 


yk(t)  =  L('£3*ti(t)) 


(2.17) 


But  since  L  is  linear  then 


y*(<)  =  X>* 


In  the  basis  ,  <^„(#)},  £(<£,(£))  can  be  written  as 


or  in  vector  form 


L($)  =  P$ 


(2.20) 


where  P  =  [p,j]  is  a  n  x  n  constant  matrix.  Therefore,  in  vector  form  (2.18) 
can  be  written  as 


yk(t)  =  0T(k)P$ 
Using  (2.21)  and  (2.11)  we  have 


a(k)  =  PTf3(k) 


(2.22) 


Plugging  (2.22)  in  (2.14)  we  get 


P(k  +  l)  =  3(k)  +  ~,(d-PT3(k)) 


(2.23) 


£(fc  +  l)  =  (J-7P7’)dU-)+7ci 


(2.24) 


The  convergence  of  Equation  (2.24)  and  the  choice  of  7  axe  stated  in  the  fol¬ 
lowing  theorem. 
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Theorem  5. 


Let  ijd(t)  be  a  given  desired  trajectory  defined  over  a  time  interval  and 

let  e*(f)  be  as  defined  in  (1.7)  and  ek{t)  be  its  e  -approximation  with  respect  to  $. 
Also  let 

I($)  =  P$  (2.25) 

where  P  is  a  .V  x  N  constant  matrix,  and  L  is  the  linear  operator  representing  the 
system.  Then 

Hetll — ♦  0  as  fc — *  oo  (2.2C) 

if  and  only  if 

(tr(P))2  >(.V-1)||P!||  (2.27) 

where  ||P||^.  and  trP  axe  the  Forbenious  norm  and  the  trace  of  P  respectively. 
Proof: 

From  (2.S)  we  have 


n 


5fc+i(*)*Ij(af-af+1)0i(O 

i=i 

(2.2S) 

By  letting 

£(*  +  l)  =  [a?-a*+1 . a* -a*+1]r 

(2.29) 

we  get 

e*+i(<)  =  £T(£  +  1)$ 

(2.30) 

Also  (2.29) 

can  be  written  as 

<£(&  +  1)  =  d  —  a(k  4-  1) 

(2.31) 

Now  using  (2.22)  and  (2.24)  we  get  from  (2.31) 

£(Jt  +  1)  =  d  -  PT,3(k)  -  1PT{d  -  o (k)) 

(2.32) 

or 

<f[k  +  !)  =  (/-  ~fPT)<£{k) 

(2.33) 

Without  loss  of  generality  it  is  assumed  that  the  basis  is  orthonormal  therfore  we 
conclude  using  the  definition  of  L-2  norm 


11 


v.yn' 


I«*ll-  £(v>?+l)  =  «£(*  + 1)11 

.*=1 


To  simplify  the  notation  let 


F  —  I  -  yP7 


(2.33).  (2.34)  and  (2.35)  imply  that 

11^+111  =  11^)11  (2.36) 

Consider  the  Forbenious  norm  of  the  matrix  F  which  is  equal  by  definition  to 


IIFIIf  =  ltr(FTF)]'/* 

then  it  can  be  shown  that  for  any  v,  w  in  Rn  we  have 


\Fv\\  <  ||F||f  ||i;| 


(2.37) 


(2.38) 


in  particular 


\\F^(k)\\  <  ||F||f  ||£(fc)||  (2.39) 

By  letting  ||F||f  =  P  and  by  choosing  7  such  that  0  <  p  <  1  (2.39)  implies  that 


0  <  ||efc|j  <  pfc||e0ji  — ►  0  as  k  — >oo 


(2.40) 


(2.41) 


In  this  case  the  algorithm  (2.24)  converges  in  the  sense  of  (2.41).  If  we  plug  (2.35) 
in  (2.37)  we  get 


tr(FTF )  =  (72|!P!|2f  -  27tr(P)  +  fr( /)] 


(2.42) 


For  tr(I)=  n  we  get 


tr(FTF)  =  [72||P!|2f  -  2~ftr(P)  +  n]  (2.43) 

Equation  (2.43)  is  quadratic  in  7.  There  will  be  always  an  a  for  which  (2.43)  is  <  1 
except  when 


(tr(P))2  <{n-  1)||P||f 

In  this  case  p  is  >  1  for  all  7  and  the  algorithm  (2.24)  diverges. 

2.3  Remarks  and  Discussion 


(2.44) 


.  tyVjlViY 


iCiWC? 


At  the  end  of  this  subsection  two  points  need  to  be  mentionned:  The  first 
point  is  that  as  in  the  case  of  C°-type  betterrment  it  can  be  shown  (using  the 
L2-representation  theorem)  that  if  the  transfer  function  of  the  causal  system  under 
test  is  proper,  the  algorithm  (2.24)  may  diverge.  The  second  point  is  that  in  this 
section  the  Forbenious  norm  was  chosen  mainly  because  it  is  relatively  easy  and 
inexpensive  (computer  time)  to  numerically  compute.  In  future  work  other  matrix 
norms  will  have  to  be  tested  and  their  relations  to  the  rate  of  convergence  of  the 
algorithm  should  be  determined. 


Example  2.1 

The  applicability  of  the  result  of  theorem  5  is  illustrated  in  this  example.  Con¬ 
sider  the  first  order  linear  time  invariant  system  described  by  the  following  transfer 
function: 


F(s)  =  100 


s  -f-  .89 
s  +  0.9 


For  easy  checking  with  analytic  result  let  the  desired  output  t jj(t)  be: 

yd{t)  =  3.464101  —  0.73205 
which  can  also  be  written  as  in  (2.6) 


J/d(l)  =  Ci<i>i{t)  -f  C2 02 ( 1 ) 


where 


c,  =  1 
c2  =  1 

and 

<P  i(0  =  1 

<h(*)  =  2^31  -  \/3 

are  two  Legendre  functions.  In  this  case  N  =  2  and 

[99.62  -.19' 

^  “  .19  99.94 

the  condition  (2.27)  is  satisfied  and  the  algorithm  (2.24)  converges  in  two  iterations 
as  it  is  shown  in  figure(2.1) 


3.0  Comparison  of  Learning  Algorithms 


*•>*-.*- 


•Ti 


In  this  part  a  comparison  between  the  2  methods  introduced  above  is  presented. 
Based  on  the  decomposition  technique  and  the  anology  with  the  C'-type  betterment 
an  extension  and  generalization  of  the  method  of  section  2  are  presented. 


3.1  An  Extended  Algorithm 

Let  us  now  use  the  decomposition  technique  on  the  C'-type  betterment.  Again 
using  the  same  notation  as  in  part  2  we  will  have  : 


yd(t)  =  dT  $ 

(3.1 

'Jk(t)  =  aT(k)$ 

( 3.2 

u{k)  =  JT(k)$ 

(3.3 

From  the  C'-type  betterment  formula  we  get 

ttfc+ 1(*)  =  « kit)  +  6~{yd(t)  -  yt(0} 

(3.4 

or 

3T(k  +  1)$  =  JT(k)$  +  S(dT  - 

i  3.5 

Let 


.  n 

<t>i  (t)  =  ^2 

i=i 


0n(O  = 

1=1 

or  in  vector  form 


’  0i (t)' 

'a  n 

aI2 

•  ain  ' 

,0n  (0. 

-  1 

an2 

^nn  - 

therefore 


(3.G) 


(3.7) 


$  =  A*  (3.S) 

Using  (3.S)  in  equation  (3.5)  we  get 

JT(k  +  1)  =  Jr(l-)1>  +d[dT  -  &T(k))A*  (3.9) 

Let  P  be  the  matrix  of  the  operator  L  in  the  basis  [o, ( t )]  i.e. 


14 


L(<P)  = 


therefore 


a(k)  =  PT3{k)  (3.10) 

Now  we  have 

3T(k  +  1)  =  ir(A;)$  +  6(dT  -  3T(k)P)A$ 


=  {$T(k)[I  -  6PA]  +  6dT A)$  (3.11) 

3(k  +  1)  =  [I  -6ATPT}.3(k)  +  6ATd  (3.12) 

If  we  let 

PA  =  R  (3.13) 

ATd  =  V  (3.14) 

then  equation  (3.12)  becomes 

0(k  +  l)  =  [I  ~6RT]§{k)  +  6V  (3.15) 


Equation  (3.15)  and  (2.24)  are  very  similar  therefore  we  can  conclude  that  the 
C°-type  betterment  and  C'-type  betterment  are  just  two  different  ways  of  writing 
the  limitimg  case  of  algorithm  (2.24).  On  the  other  hand,  one  should  notice  that 
equation  (3.12)  can  be  also  arranged  in  the  following  way.  Let 

M  =  6AT  (3.10) 

then  using  (3.16)  we  can  write  (3.12)  as 

3(k  +  1)  =  (/  -  MPT)3(k)  +  Md  (3.17) 

This  important  result  is  generalized  and  discussed  in  the  next  subsection. 


3.2  Generalization  Of  The  Algorithm 

The  method  of  section  2  and  the  extended  algorithm  of  subsection  3.1  (equation 
3.17))  can  be  combined  in  the  following  way.  Assume  (like  in  the  case  of  mixed  type 
betterment)  that  the  e  -approximation  of  the  k  +  1  input  ujt+i(f)  is  given  by  the 
following  iteration: 


(3. IS) 


let 


3r(k  +  1)  =  r  (*■)$  +  -id  -  or(A))$  +  b\dT  -  ar(A'))'i> 

L(<P)  = 


therefore 


1 3.10 ! 

■3.20) 


o_(A-)  =  PT  3{k) 

which  gives 


3T(k  +  1)  =  3T(k)<t>  +  y(dT  -  3T(k)P)$  +  6(dr  -  3T(k)P)A*  ( 3.21 ' 

3T(k  +  1)  =  {3T(k)[I  -  -  P  -  SPA]  +  dT[~fI  +  d.4]}$  i3.22', 

which  implies  that 

3(k  +  1)  =  [I  -  yPT  -  6AT PT}3(k)  +  [yl  +  6AT}d  (3.231 

Let 


Q  =  1I  +  6At  >  3.24 ) 

then  we  can  arrange  (3.23)  as 

£(*  +  1)  =  [I  -QPT]d(k)  +  Qd  (3.23) 

where  Q  is  a  constant  n  x  n  gain  matrix.  Equation  (3.25)  can  be  therefore 
considered  as  a  generalization  of  equation  (3.17)  and  equation  (2.23) 

The  convergence  of  tha  above  algorithm  is  given  by  the  following  theorem 

Theorem  6.  For  the  problem  considered  in  theorem  5  and  a  given  desired  trajcc- 
tory  yd{t)  defined  over  the  time  interval  [<1,(2],  and  for  a  given  set  of  orthogonal 
functions  defined  by  the  vector  valued  function  $  if  there  exists  a  constant  X  x  X 
matrix  Q  which,  satisfies  the  following  condition 

\\i-qpt\\f  <  1 


then 

||efc||  — *  0  as  k 


Proof: 

In  the  previous  subsection  we  defined 


00 


ek+i  (t)  =  £T(k  +  !)<!> 


where 


(3.2G) 


16 


■yZi  k  4-  1 )  —  d  —  &(  k  -L-  1 ) 

=  d-  PT  3{  k  +  I! 


using  iteration  formula  (3.25)  we  get 


rdk  +  1)  =  r-(  k )  -  PrQ^k) 


r-(k  +  1)  =  (I  -  PTQ)r<  k) 


X(  k  •+■  1 )  —  Fy ~\  k ) 


where  F  is  a  n  x  n  matrix  defined  bv 


F  =  I  -  PtQ 


(3.30i 


If  we  use  the  Forbenious  norm  for  the  matrix  F  and  the  L2-norm  for  the  vectors 
£{k  +  1)  and  £( k)  then  the  inequality  (2.30)  implies  that 


>(*+)!!  <l|F||f|M*)ll 


i 3.32 ) 


then  by  choosing  Q  such  that 


we  have 


||F||f  =  p  >  0 


0  <  P  <  1 


|  e  fc  [  [  — *  0  as  k  — ►  oc 


and  the  convergence  is  assured.  From  (3.25)  it  is  clear  that  if  P_l  exists  then  the 
best  choice  will  be  Q  =  P~T .  In  this  case  we  have  J( k )  =  P~rd  for  all  k  >  0  and 
the  algorithm  converges  in  one  step. 


Example  3.1 

Consider  the  following  system: 


x  =  .Ax  +  Bu 
V  =  Cx 


•■V.V.V.V.V.V.  '-V.V.V.V.V.V 


4  = 

-1  —2 

and  the  desired  output  is 


B=l: 


c  =  n  o 


yd[t)  =  12#2(  1  -t) 

for  t  in  [0.  1].  For  this  system  the  condition  (2.27''  of  theorem  5  is  not  satisfied, 
however  by  choosing  a  set  of  4  Legendre  functions  with  a  matrix  Q 


-100.58 

177.04 

-100.03 

190.S0 

-170.70 

2S0.0 

-272.10 

302.15 

-105.  SO 

270.66 

-304.80 

311.0 

-190.70 

299.10 

-30S.60 

230.50 

for  which  the  condition  of  theorem  6  is  satisfied.  t/*(?)  conx’erges  to  yd(t )  in  4  it¬ 
erations  as  it  is  shown  in  figure  (3.1).  We  notice  also  that  since  in  this  example 
CB  =  0.  the  conditions  for  the  convergence  of  the  C'-type  betterment  as  has  been 
shown  in  [3].  [4]  are  not  satisfied.  Figure  (3.2)  shows  the  divergence  of  y^.(f)  from 
the  desired  trajectory  yd(t)  as  the  number  of  iteration  k  gets  larger.  However,  for  a 
system  with  the  same  matrices  .4  and  B  as  the  system  considered  in  this  example 
but  with  matrix  C  equal  to  : 

C  =  (0  l] 

the  C’-type  betterment  converges  as  it  is  shown  in  figure  (1.1). 


3.4  Remarks  And  Discussion 

In  this  part  an  iterative  method  which  can  be  considered  as  the  generalized 
version  of  the  algorithm  (2.24)  was  developed.  This  method  is  also  based  on  the 
assumption  that  the  system  should  be  linear  time  invariant  and  continuous  and 
that  the  desired  output  £/<*(()  should  be  given  on  the  entire  interval  [0,  Tj.  The 
Convergence  of  the  algorithm,  then  depends  only  on  the  possibility  of  finding  a 
matrix  Q  such  that  the  Forbenious  norm  of  the  matrix  F  is  less  than  unity.  As  far 
as  the  numerical  considerations  are  concerned  the  algorithm  developed  in  this  part 
is  easy  to  implement.  Except  for  the  decomposition  of  the  desired  output  yd{t), 
there  is  no  integration  or  derivation  to  be  computed.  Furthermore,  this  algorithm 
will  converge  in  one  step  if  the  inverse  of  the  matrix  P  can  be  numerically  computed. 
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On  the  Design  of  Time-Varying 
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Abstract: 

It  is  well  known  that  in  linear  time- 
invariant  design,  response  speed  and  accuracy 
tend  to  be  conflicting  requirements  that  cannot 
be  achieved  simultaneously.  For  example,  fast 
response  always  results  in  a  large  overshoot. 
Fast  response  without  overshoot  is  desirable  in 
many  applications  such  as  robot  manipulator 
control.  In  this  paper,  it  will  be  demonstrated 
that  the  use  of  time-varying  controller  can 
produce  a  response  with  short  rise  time,  no 
overshoot  and  short  settling  time  that  are 
impossible  to  attain  with  conventional  time- 
invariant  design.  Methods  of  designing  the 
time-varying  controller  will  be  discussed.  In 
particular,  the  method  of  dynamic  pole  assign¬ 
ment  and  the  use  of  piecewise-constant  time- 
varying  feedback  gain  implementation  will  be 
emphasized. 


I 


I.  Introduction 

In  linear  time-invariant  system  design,  the 
performance  requirements  of  fast  response  and 
good  accuracy  tend  to  conflict  with  each  other. 
So  compromises  often  have  to  be  made.  To  ensure 
overall  satisfactory  performances,  time-varying 
controller  that  adapts  itself  to  yield  optimal 
performances  is  needed.  In  this  paper  it  will 
be  demonstrated  that  the  use  of  time-varying 
controller  can  produce  a  desired  response  with 
short  rise  time,  no  overshoot  and  short  settling 
time,  which  are  Impossible  to  attain  with  con¬ 
ventional  time-invariant  design.  It  is  whown 
that  with  time-varying  controller,  not  only  the 
overall  performances  are  improved,  but  the 
system  stability  requirement  is  also  greatly 
relaxed.  In  contrast  to  the  linear  time- 
invariant  design  which  requires  all  eigenvalues 
(or  the  poles)  of  the  system  to  be  in  the  open 
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left  half  complex  plane,  the  time-varying  control¬ 
ler  may  allow  the  overall  system  eigenvalues  to 
be  moving  in  the  complex  plane,  even  in  the  onen 
right  half  complex  plane  for  some  intervals  of 
time,  provided  that  at  the  end  the  eigenvalues 
of  the  systems  are  placed  at  some  desired  steady- 
state  pole  locations  in  the  open  left  half 
complex  plane  to  insure  the  overall  system  stabil 
ity.  The  idea  of  allowing  the  poles  (the  eigen¬ 
values)  to  move  around  makes  it  possible  to 
achieve  a  response  with  snort  rise  time,  no 
overshoot  and  short  settling  time,  which  are 
impossible  to  attain  in  time-invariant  design. 
Several  methods  of  designing  the  time-varying 
controller  will  be  discussed.  A  new  concept  of 
employing  dynamic  pole  assign!-*""  «- ■' 

constant  time-varying  Yeeatac<  gjm  ror  tne 
controller  will  be  emphasized. 


II.  Controller  Design  Methods 

The  problem  considered  is  as  follows:  Given 
a  plant  with  transfer  function  G(s)  or  it  is 
defined  by  its  state  representation 

i(t)  *  A  k  (t)  +  BuU)  (la) 

ijU)  =  C  x  (t)  _  (lb) 

where  x_(t)zRn,  u(t)cR  and  y(t)t:R.  Also  given  is 
a  desired  response  ty , ( c )  to  a  given  input.  The 
objective  is  to  design  a  controller  so  that  »,(.*' 
can  be  achieved.  'a 

If  the  requirements  for  uAt)  are  such  as 
the  rise  time,  the  overshoot  add  the  settling 
time  are  not  very  stringent,  then  conventional 
lead-lag  compensator  (1)  or  the  state  feedback  _ 
with  constant  gain  matrix  [2)  will  be  adequate. 
However,  if  yAt)  is  required  to  have  very  fast 
response  without  overshoot,  for  exampl®,  iy,(t) 

Js  to  approximate.  g^(t)  *  1  -  exp(-t‘),  then 
one  has  to  resort  to  dither  time-varying  or  non¬ 
linear  controller.  Unfortunately,  the  stability 
of  linear  time  varying  system  [31  or  the  non¬ 
linear  systems  [4]  are  very  difficult  to  assure. 
In  this  paper,  several  controller  desiqn  methods 

that  will  assure  stability  will  be  discussed.  _ 

One  particular  method  which  uses  piecewise 

constant  time-varying  state  feedback  will  be  _ _ 

emphasized. 
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(I)  Optimal  Control  Tracking  Problem  Approach  iS] 
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The  optimal  control  theory  can  be  used  to 
design  the  controller  so  that  the  output  of  the 
system  will  track  the  desired  response  u^(t). 
The  controller  is  to  be  designed  according  to 
some  chosen  performance  index.  In  such  design, 
the  controller  turns  out  to  be  a  time-varying 
controller  which  requires  the  solution  of  a 
matrix  Riccati  operation.  Hence  the  controller 
is  difficult  to  implement.  Furthermore,  the 
choice  of  weighting  matrices  in  the  performance 
index  for  guaranteeing  the  desired  response  is 
not  transparent. 


(II)  Multi-Segment  Decomposition  of  Desired 
Response  Approach 

A  time-invariant  system  may  not  produce  the 
desired  response  over  the  entire  interval  of 
operation.  However,  by  decomposing  the  desired 
response  into  several  finite  subintervals,  say  N, 
itis  possible  to  approximate  the  desired  response 
by  a  linear  time-invariant  system  over  each 
interval  of  interest.  Although  the  system 
appears  to  be  linear  time-invariant  over  a  given 
Subinterval,  it  is  time-varying  and  nonlinear 
over  the  entire  course  of  operation.  Under  the 
mul ti -segment  decomposition  approach,  two  con¬ 
troller  design  methods  are  considered: 

(a)  Direct  Control  Law  Generation  Method 

From  the  desired  response  ;/,(t)  over  the 
interval  f-[t.,  f  .r.  I,  I,  .  .  . N-l,  one 

determines  thfc  co^-espondi ng  desired  state  jc(f.) 
and  x{  C  •  .'.  The  control  law  U(c)  for  te[t(.,C^,! 
that  will' transfer  x(f.j  to  x(t-  .)  is  generated 
as  follows: 

ultl  •  -8T  exp iA  IQ*1  t  ,t  • 

—  <■  4.  4.  <.♦  I 

<x'(c  -  expi  a  t  -t  ; 1  xf.t  , :  k  (2) 

tc[t  ,t  .) 

c  <  .  ♦  I 

"here  t  .  At--:  .  A  t  -- 

Q!  t  ,r  t*  e  *•  “•  SB  el  1  i* 

L 

controller  aqa^n  a  ti"*- varyi^q  :ontr^!?er 
*ni  :o  ’  $  to  •■xlen«nt.  )ne  •nodi*,ei 

approacn  is  to  *or*  th  tne  ji scret- :ed  plant 
and  *o  generate  a  »eouer>ce  of  lucrete  control 
'a*s.  In  tvs  :aset  a  "'u '  ti -rate  sanolm.q  *viv 
na^e  *o  oe  erpioye-i,  Jepenqmq  or  *:ne  :*a-Kte'- 
ist’cs  o#  **e  deseed  '■•sconce  will  :aM 

for  ~ o n - *0 u a '  ’enqtn  )*  segments  in  tne  deco^oosi- 
t4cn  IPDrnacn  is  not  /erv  Stra»qnt 
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a  sequence  of  piecewise  constant  gain  matrices 
which  can  be  easily  designed.  The  stability 
assurance  and  gain  computations  will  be  given  in 
Section  III.  A  design  example  that  illustrates 
the  procedures  is  given  in  Section  IV. 


III.  Key  Results  for  Piecewise  Constant 
Time-Varying  Controller  Design 

Theorem  1.  (Stability  Assurance) 


Consider  the  system  (1).  Assume  that  (A, 3) 
is  completely  controllable.  Then  there  exists 
a  set  of  constant  matrices  K . ,  c-0,  1,  2,  .  .  N-l 
such  that  with  the  state  feedback  u(t!  *  wit)  - 
Kit)  x  It),  where 


K(  t]  ■ 


K.  tc [f  ■ , t  ■  . )  i'O, 
<.  <.’  of  ' 


t  K, 


N-l 


t>t„ 


.N-l. 

(A) 


the  eigenvalues  of  Alt)  *  A  -  “Kit!  can  be 
arbitrarily  assigned  over  each  interval  tc[t., f.  .). 
Furthermore,  if  all  eigenvalues  of  A.j,.  A  -l  3^„  ( 
have  negative  real  parts,  then  the  system  is  stable. 


Proof 


The  pole  assignment  part  is  well  known 
because  (A, 3)  is  completely  controllable.  So  only 
the  stability  part  will  be  proved. 

For  any  bounded  input  v 1 t)  we  have  for 


A  . '  1. 1 . 1  ft  A  .it--! 

<!t)»e  4  1  xltJ*  (  e  c  3v(  t !  d:  (5) 

•*  <. 

where  A  *  A  -  3K.,  i’C,  I,  2,  .  .N-2,  and  A.  is 
not  required  to  b&  stable. 


Since  the  linear  system  can  not  have  finite 


escape  time,  hence 
Now  for 


x  :  t!  *e 


N-l 


t-tu 


it!  ■  =  for  al  1  Q<t- 


11  v  I  A  > 

i.  "N 


! 

By  I  T  '  dt 


(6) 


Clearly  '  x ' f '  1  ' •'»  t  f'f,  bqcause  A'r|*A,  ,  for 
r-rN  and  all  eigenval ues ’of  A  .  nave  reaT‘parts. 
Hence  x,r  '»  vt'O  and  the  system  is  stable.  — 


Conxnent : 

’he  only  requirement  for  assuring  the 
Stability  IS  that  Alt!  *  A,(_ ,  for  t  >  tj.  and  A, 
be  stable.  Ac's  for  i-N-T  ire  not  required  to' 
be  stable.  This  flexibility  permits  some  >o 
De  .nstable,  particularly  in  the  initial  phase  of 
operation,  for  fast  response.  ~ 


Lenina  1  (Bass-Guru  formula! 

Consider  tne  system  d).  uet  f f s  :  Pe  the 
characteristic  polynominal  of  A  given  by 
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V  -A-  ,.1-  . 


.  .  v  /.-Vv' 

•  -"v  -  -**  ■ 


d(%)  -  s”  + 


...  *  j. 


Let  i(s)  be  the  desired  characteristic  poly- 
nominal  for  the  system  with  the  state  feedback 
uUI  -  vUl  -  Kxjt).  let  i(s)  be  given  by 

a(s)  =  Srt  +  a1s'1*1  +  .  an  (8) 

If  the  system  (1)  is  completely  controllable, 
then  the  state  feedback  gain  K  always  exists  and 
K  can  be  computed  by 

K  -  (J  -  aVV  (9) 


i  *  a2 . V 

a  =  ,,,•>$  *  ) 

S  is  a  upper  triangular  Toeplitz  matrix 
with  first  row  (1.  ^ »  ^2>  •  •  •  a„_])  And 

Q  is  the  controllability  matrix  of  (1) 
given  by 

Q  «  (B,  AB . A^B) 

The  proof  can  be  found  in  [6],  hence  it  is  omitted. 


Theorem  2  (Piecewise  Constant  Controller  Design) 

Consider  the  system  (1).  For  a  given  input 
v(C)  let  the  desired  response  (/,(£)  be  partitioned 
into  N  finite  segments  so  that  :?,(£)  for 

be  characteri zed  by  a  set  of  desired 
eigenvalues  v<ij' ,  i-0,  1,  2,  .  .N-l,  j*l,  2,  . 

.  .  n  and  Re(v„  ,,j)<0  for  all  j=*l,  2,  .  .  .n. 
Then  there  exists  a  piecewise  constant  time- 
varying  state  feedback  K(-t),  as  defined  in  (4), 
that  will  produce  a  desired  stable  response 
Furthermore,  K^  in  (4)  can  be  computed  from 

- Kl  *  (al  '  S'V1  ■i'°>  ’’  z-  •  •N*1  (50> 

where  xT  3  (a..,  a..,  .  .  .  ,  a.  )  is  the  now 
vectorTfssociited  Afth  the  characteristic  poly- 
nomina?  -  *  “ 

n 

ot.(s)  3  srt  +  *.tsn-  +.  .  .♦a-*  *  (s-Yiy){H) 

■C  -Cl  -Cll  y.1 

and  a  ,  S,  and  Q  are  as  defined  in  Lenta  1. 


Theorem  2  follows  from  combinations  of 
Theorem  1  and  Lenina  1.  ..  .  _ 


IV.  Design  Example 

2  Consider  a  plant  with  transfer  function  G(s)* 
1/s  .  Suppose  the  desired  response  u.(CI  to  a 
unit  step  input  is  as  shown  in  Fig.  IT 


<S-W  cwgH.-*!.  ,o  me  a«.  ** 

h.11,  log,!!.  ..Pl^‘“=u« 


Such  a  desired  response  can  be  described  by 
y(TI  ♦  2  ;!£)  y  (cl  *  alt  I  »  1  (12 

with  the  damping  ratio  ;'C)  given  by 


-1.0  Cc (0,  0.7) 


0.5  Cc [0.7,  1.3) 

4  £*1.3 


or  equivalently  by  the  desired  eigenvalues 

<V'  V  =  ”'n 


(v,,,  Yj2)  , 


1-0. 5  :  j  0.S66I 


(Yj,,  y2:)  -  (-0.13,  -7.S7)  (14) 

It  can  easily  be  seen  that  for  the  implemen¬ 
tation  shown  in  Fig.  2  the  piecewise  constant 
time-varying  controller  gain  Kltl  is  given  by 

|  K0  *  (I,  2)  Ti(0,  0.7) 

KIC)  * f  K.  =  (1,  1)  Ce[0.7,  1.3) 

I 

{  <2  *  (1,  3)  £>1.3  (15) 

The  pole  movement  in  the  complex  plane  is  shown 
in  Fig.  3 
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feedback  linearization  theory  >n  differential  geo¬ 
metric  control  theory  1$  used  m  conjunction  men  time- 
varying  state  feedoack  to  design  the  time-varying  control¬ 
ler  for  a  multi-joint  rooot  manipulator  control  A  3-joint 
moot  manipulator  is  used  as  an  examole  to  evaluate  the 
controller  performances.  Simulation  results  snow  great 
premise  of  employing  time-varying  controller  to  achieve 
Setter  dynamic  control  over  that  of  time-invariant  control¬ 
ler. 


I.  Introduction 

tmoelled  oy  concerns  aoout  productivity,  worker 
shortaqe.  ana  hostile  environments,  interest  in  roootics 
has  grown  explosively  in  the  last  several  years.  Most 
Industrial  moots  are  basically  computer  controlled  mechan¬ 
ical  manipulators  wnich  can  be  programmed  to  perform  Che 
Casts,  with  minimum  or  no  human  intervention,  of  arc  weld¬ 
ing,  paint  spraying,  assemoly,  foundry  ooeration,  etc.  A 
manipulator  consists  of  a  series  of  lints  wnicn  are  con¬ 
nected  at  joints.  Typical ly  they  nave  three  to  sit  joints 
(three  to  sit  degrees  of  freedom)  with  a  griooer  or  end 
effector.  The  Joint  can  so  either  a  mvolute  joint  -or 
rotational  motion  or  a  orismatic  Joint  for  translational 
motion.  Each  joint  is  driven  oy  an  actuator  wnich  is 
commenced  oy  the  controller. 

The  motion  of  the  robot  manipulator  is  desired  to  be 
fast,  smooth,  and  accurate.  The  ooen-iooo  control  method 
is  not  sacisfatory  because  of  variable  inertia,  gravity 
loading  and  load  dlsturoances.  Therefore  the  dynamic  con¬ 
trol  of  the  ropot  manipulator  is  often  of  closeo-'ooo 
control.  The  design  of  controller  for  the  servo  system 
deoenas  on  the  dynamical  model  of  the  system  to  be  control¬ 
led.  "he  dynamic  eauations  of  a  robot  manipulator  can  oe 
obtained  either  by  Lagrangian  formulation  or  the  Newcon- 
Euler  formulation  [1-3].  These  dynamic  eauations  are 
highly  nonlinear  ana  strongly  couoled  differential  eoua- 
tions.  In  the  past,  the  controller  design  is  mostly  based 
on  independent  joint  control  [1-2],  I  Ineariratlon  and  feed¬ 
forward  compensation  [3-*].  One  best  xnown  examole  of  suen 
approaches  is  the  'comouted  toroue'  technique  [3]  or  the 
'inverse  problem'  technigue  [5],  These  technigues  reouire 
either  the  manipulator  lines  being  weekly  couoled,  or  eacn 
link  being  controlled  one  at  a  time  and  precise  Knowledge 
of  robot  manipulator  dynamics  be  known.  Such  taenmaues 
become  less  effective  for  nigh  speed  and  accurate  control. 
It  also  often  requires  excessively  large  control  torques 
or  forces  from  the  actuators. 

decent  advances  In  differential  geometric  control 
theory  [6-9]  provide  necessary  and  sufficient  conditions 
for  transforming  a  given  nonlinear  system  around  a  given 
point  Into  a  local  feedback-eoulvalent  linear  system. 

These  results  are  found  to  be  particularly  useful  to  robot¬ 
ics  aoollcations  because  the  kronecker  indices  for  such 
systems  are  mostly  less  than  or  equal  to  3.  With  orooer 
coordinate  transformation  and  feedback  transformation,  the 
nignly  nonlinear  and  couoled  roeotlc  equations  can  oe 
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transformed  -nto  a  set  bf  oecovoieo  secona-orqe-  -ear 
dynamical  ecuacions.  Jpnseauent! /,  -acous  ro-t-o  er 
design  technicues  'or  *near  systems  become  ido  cap  e. 

'he  use  or  feeupacx  '< near- tat i on  aporoacn  to  oes’gn  -boot 
controller  are  -eportea  to  give  /er  y  good  smulaf  m 
results  [7,  3,  10], 

In  this  oaoer.  ,«  snail  '1-st  snow  row  the  'eeobacx 
1 1  neari zati on  teenmaue  can  be  used  to  transform  n’gniy 
nonlinear  ana  coupled  -oootic  eauations  into  a  sec  bf  de¬ 
coupled  second-order  linear  system  eauations.  Then  con¬ 
troller  design  oasea  on  time-varying  stace  feedback  ml' 
be  discussed,  final  ly,  computer  simulations  "’or  a  3-jomt 
robot  manipulator  control  are  presented.  The  simulation 
results  snowed  clearly  tne  potential  of  a  t'me-vary-g  con¬ 
troller  for  achieving  -aster  ana  accurate  response  men 
smaller  control  efforts  tn an  a  conventional  time- 1 nvar •  an t 
control ler. 


II.  Main  Jesuits 

Definition  1.  kronecker  Indices 

The  kronecker  -nalces  'or  a  matr'x  pair  A,  j)  are 
defined  as  ‘oilows:  vet  3,  and  i,  be  jef‘"ea  as: 

3;  =  [8.  A8.  .  .  a"’  3] 

b 

tj  •  dim*.  -  dim9i_i 

wnere  i  *  l,  2.  ...  are  integers,  dim*,  is  ;ne  dimension  of 
3,  and  dlm*^  a  0.  Then  'or  an  'nteger  ;,  the  kronecker 
index  k,  is  aeflned  to  oe  tne  totai  numoer  of  l,  -men  are 
greater  than  or  edual  to  J ,  '.a., 

kj  *  {  J) 

Definition  2.  Invo I utl veness 

Let  CM*'  .  <*'•  be  a  set  of  smooth  vector 

fields  in  3n  With  l'i|5),...,  X^’j)  being  'Inearly  ndeoen- 

dent  for  some  point  p.  Then  the  set  C  is  saia  to  be  'n- 

volutlve.  If  for  any  X^cC,  there  exists  smooth,  -eal 

valued  'unctions  (o'-(k)if,  suen  tnet  tne  vie  product 
_  m  - 


(',X^](x|  »  t  x ) x'.'xj  'or  k  m  the  nei 


gnbornood  bf  0 


Theorem  1  Mu  1 1' - Input  feedback  linear'  :at*on  ’’•eorem 

Considar  the  multl-'nout  nonlinear  syscem  of  the  'orm 
.  0  n  1 

t_  -  x  ; « i  ♦  :  u,  (x  ix  ;  1 1  ( i ) 

i  *  i 

where  ic3n.  X^(i|  ano  X 1 ( * ) ,  i  *  I,  2 .  n,  are  smoot- 


vector  Melds  t  3  and  u(I«l,  i  *  I,  3.. 


valued  smootn  scalar  functions,  cat  j  oe  the  oo’nt  jf 


Ki 


s 


SX 


•nterest  -‘or  ' i near  * :ac i on  .  '**  ~ecessary  ana  suf'*c*ent 
conai;'om  'or  tne  system  (M  *»ith  «ronecxer  ncicts  ».  ; 
<-,  i...l  < ^  to  0*  'eeabacx  eou» valent  to  tne  -nur  t*me- 
invar’int  :iromcii  'orm 

*  =  *v  *  5u  2 ; 

«n#r@  »  ^  i  ■>  ,  , 

I  4  ■*  - 

M  3  •  -  :  i 

A  =  3iag(A,.A, .  a  }  *»tn  •»  =  j . i 

:  3  :  :  ■  | 

1  3  3  3  -  3  i  < 


mo  3  =  Tb. . b^J  *itb  b  .  0 ,  3 .  '.3 .  3]' 

3ting  in  i  x  i  vtctar  .itn  *  *n  tnt  :  <  :3ffioon.«t  ino 

zeros  eisewntre. 


( i  )  "here  exist  -ex)  (a,}',  sucn  :nit 


i,(  .3i  -  3 

(ii)  dirtiCljl  *  n  (loci)  control laoi  1  i ty  ceati,  «ntr«  C  s 
1  31  n  *  n-* 

C  *  ;x  ,  Uax-.x1),  ...  da  r.x  t*.  aexJ.<-\ 

<,-2  ’  •>  n  i  <  -  ■  , 

_ (aa  -  X".X';;...;  X  ,UaX  .X'1: _ *  aa  ’  r'.An;; 

-itn  :aa3x.')»y,  (aaX.r )*fx,r], • • . :aa‘x. < )*{< . .aa'" x , f ] , 
being  tne  Li*  oroduct  notations  £X.f3*<  '.xjffxi-'  '<jx(xi 
and  X<t  are  cn*  Jacooian  of  X  and  f  with  respect  to  x. 

(Ill)  for  eternal,  2 . the  set  C,  defined  iS 

--  Ci  =  <x  .UaxJ..<‘) . (ad  C,A*U  r.  UaxJ.x~',t 

<  *2  <  •i 

....  {a a  ’  X3,X2);...;  x^.  ( adX° , xf’ 5 .  (aa  1  x3 .x?’ ; } 

is  involucive. 

Tha  oroof  far  thaoreei  1  can  ba  found  *n  f®.  33. 
Theorem  2.  (faaaoacx  Linearization  of  ?oootic  iouation) 

Canjiaar  tne  oynamic  equation  of  a  n-;oint  -ooot 
mamoulator,  given  oy 

0(3)3  *  H<a.  a)  *  3(a3  *  ■  (3) 

whara  3*(o^ , O, . an)C  **  -0,nt  aosttion  vector, 

0(3)  u  an  n  x  n  matrix  reoresentlmj  effectiva  ind  couoled 

Inertia,  h(3,  3)  is  an  n  x  1  victor  r,0resanting  ;na  Can- 

trisatal  and  Coriolis  tarns  ana  £(3)  is  an  n  x  1  vactor 

reoresantlng  tni  gravity  loading  tarns,  lat  3,»x,(_.  ana 

q,=x,(,  >*l,  Z .  n  oe  cnostn  as  stata  virnolas  ana 

1  ci  --  Q 

rawita  (3)  as  m  (1),  whara  X  (x (xfxj, f , (x I .  x^,  f,;x|,.. 
...  *2n.  '„(£)]*  *na  x 1  ( * )=ro.  0 . 1,  0 . j]\ 

1*1,  2,  ....  n,  is  *n  2n  x  J  vector  witn  ?  in  tn*  21 


contains  a  nonlinear  comoination  of  some  torques  or  forces 
t ^  and  f<(x)  Is  a  smooth  real  'unction  of  <  'or  *11  1. 

Let  o  3e  tne  eouilibriuoi  point  of  (1)  for  jf(xi-0  'or  ail  i. 
Then  :n*  coordinate  transformation  ^*x  and  :n«  feedOacx 

transformation  j .  (x  )*-f  .  (x  ,  i*l,  2 . n  with  ji 

being  a  new  free  control,  will  transform  (I)  *nto  j  'eed- 
bacx  eguivalent  linear  tlme-’nvan ant  canonical  *orm  ;2) 


A  Jo  ’] 

*1  10  3  J 


yi  »  i,  z . i 
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i  tn  Jn  x  ector  «*«v*  ^  c-e  J  ?  i-o  cs^os  e  ;i 


A»"30f  3 T  ’Heor*m  2. 

'•••sc.  -•*  snow  '.“it  *.*»  T-recxir  •“O'ces  'o'*  system 
;  '  ;  ror-esoonai  ng  ■  yt  irv  i  to  :r  ess  :“»n 
Since  ^,=-<  c  -as  <  o,:  S.  2.  ...  2;' 

**itn  2*  ?'ement  oemq  '  ana  :»ros  i'Sewn«r®.  c’ear  y  t.= 

L  2dl**tf.=n.  -iQ*  z  joa"  t  c  .  acA^,  ^,  0  » 


Hence  3im»?,=2.n.  t  'o  i '  ows  trat  l:  2.  jno  l.=0  'or 

i  >  3.  'herefore  :n*  <ronecx*r  indices  ire  <.=2.  yis*.2. 
...  n,  and  < .  20  'or  i  >  n.  ^hen  ^  %  Z  1 1,  tn*  conc't’on 
(iii)  *  n  tneorem  :  *s  -moiieo  3y  conoid  on  ;m».  So  to 
estaolisn  'eeaoacit  eouivalenc*  **«  need  to  cntcx  aniy  :onc,»- 
tions  (i)  and  Mi)  m  tneorem  1.  Since  <-'£i=0.  condition 
(1)  *s  satisfied  witn  3,^0  Vi.  Since  i rmC.'o .» sdimR,*2n .  tne 
system  (1)  *s  'ocaily  controilaole  at  o.  So  cnd’tic n  ’M 
of  theorem  I  -s  satisfied,  (n  'act.  all  three  :ono*t:sns 
of  theorem  I  are  satisfied,  ’herefore  system  {']  can  oe 
mao*  to  o*  feeooacx  eouivaient  to  \Z).  *o re  exoi'C'tly,  *t 
can  oe  shown  that  the  coordinate  transformation  **t  ma  t“e 


will  transr orm  ■.  1  ) 


feeabacx  transformaci on  u.(xis-f  (x)*^^  - 
Into  (2)  with  A  and  o.  as  given  *n  i 4 ) . 


1.  Theorem  2  snows  that  «ith  'eeooacx  1 i near- cat* on ,  the 
hiqnly  nonlinear  and  strongly  ccuoied  nobotic  eauations  can 
be  transformeo  *nto  i  set  of  n  cecouO'ta  5*cona»*rcer 
linear  time- • nviri ant  canonical  *orm  «ith  couoie  ooies  it 
the  origin. 

2.  ’he  smoothness  -eouirement  on  vector  ,Jeirjs  n  : "  ' 
prevents  the  aooiicacion  of  'eecoacx  ’ inear »zat'on  to 
system  «*itn  violent  non  I  inear*  t?  e$  such  is  st'cdon  and 
coulomo  friction  wmcn  are  always  oresent  -n  actuators  at 
robot  ;omts.  Such  effects  can  oe  removed  'rqm  --e  jynamic 
model  and  be  comoensated  later  ay  feedforward  comoensati on . 


[II.  Controller  Design 

The  objective  of  the  rooot  mamoulator  cont-oi  •»  to 

move  the  mamouiator  'rom  a  given  joint  oos't^on 

,  to  the  cesired  joint  position  i, 

*.n-<  J  fcn*‘ 

Eduivalentl y,  via  the  coordinate  transform,  this  s  the 
aroolem  of  moving  'rcm  the  states  /, .  y?,  ...  .to 

desired  states  y..  y^ . y,  It  ’S  cesi-ioi*  to  ;o 


•  3  tnt  ctsirta  sci:«t  n  tnt  snorcesc  oossio't  w* trout 
: Jus  1*9  any  ovtrsnooc  2 r  oroaucmq  steady-state  error  n 
on#  '•tsoonst.  ;t  s  a  i  so  ’moorcant  to  k.hiv*  sucn  aojtc- 
VvM  tn  is  'itt'*  control  «f*arti  tnt  actuator  torcuts 
ar  'orcesj  is  cossiolt  3f  courst  t.i«  si-oi’City  *  n  ^oi** 
n*nc'nq  tit  control ’tr  ”•*  to  aiw*ys  :t  *toc  ‘n  una. 

Af*»r  tnt  -lOfll  '"eir  •■oootic  eauation  •  s  convtrtea  -oca  ts 
'••aoacx  eau»v*itnt  ’ *nt«r  canonical  'orm,  /ir’ous  control* 
'tr  ctsign  ctcnnicuas  ':r  '■  intar  systems  ctccmt  aooi'caolt. 
">t  convtntionai  constant  start  'ttesacx  cts'gn  '»nos  to 
jiv#  iiower  -tsccnsts  ;r  ‘.a  -tcuir*  ir;er  cont-ti  ***zr~. 5 
tnan  :  *  o  s  t  :»s i*»c.  :oc*-ai  *r--r  ::r-»cv-: 

er  *3’  'tcui^ts  tnt  exttnvvt  ::sci.:a:‘:»*s  :•  ft  -atr*t 
-♦coat*  ecuacron.  '..o  tms  oact ** .  «*t  cons'cer  ^se  of 
tint-varying  scat*  ‘ttcoacK  controi’tr  ar'  tnt  'or ’owing 

'or-*: 


-n*rt  end  <„..t;  art  -tsotcr * vt *  y  tnt  tf*- 

/aryng  zssit’.cn  *na  vtioctty  'ttcoac*  gams.  * .  s  tnt 
-fytr»nct  ‘ flOut  and  s  tnt  itw  '>tt  control  *i  tnt 
!fn#ar  canonical  *om. 

Jjirg  tnt  -tiations  n  caoramat#  transt’orwaci on  and 
'ttooacx  transformation,  tnt  control  'unction  a:.<)  *n  M 
■s  givtn  3y 

u,  'i'**',  '*>■**-«.! » ‘'21  •  »  ”  j**m  {6i 

•wier*  Vv«i  s  is  givtn  •«  rntort*  Z.  *"0  42‘-’ 

i»l.  2 . •»  art  -it  givtn  ard  ctsirta  joint  cosinons. 

*Tia  actuator  control  torcuts  3r  'orcts  Z.  . . 

i)  art  jotai rtd  :y  solving  'ar  tnt  ucs  of 

eouat'ons  u;  .  ' * 1 ,  2.  ....  v 

In  oractict.  ait  actuators  navt  tntir  -itaimun  Mmics 
on  t-tir  tont-oi  torcuts  or  'orcts.  id  -n  assigning  tnt 
:ontro i 1 tr .  tnt  constraint  i  'or  all  :  -s  •*- 

oostd.  Vim»-varying  'etooacs  gam  and 

will  ct  so  c.lostn  :o  najiimizt  :.nt  -•soonst  sotta  ana  tnt 
accuracy  witnout  causing  tnt  control  torcuts  or  'orcts  to 
excato  ts  '‘mtJ.  'o  stmtam  cr’ttcaliy  camotd  '•tsoonst 
'or  ail  t,  ’t  -s  -touirtd  tnat  t ; ’it . 

orcctdurrs  'or  Ctsignmg  tnt  t'mt— ury’ ng  control- 
’tr  art  suuftar-rtd  3t»ow: 

[nitiai  Ut  nouC  ' . 

2.  iorroutt 

3.  3tad  <*j„j  .  *-j.i . 

a  Cvflicut#  u,(*J  'ro  m  1 6 )  and  solvt  'or 
5.  :,n#c*  to  stt  ‘ f  'r,|  i  % 

h .  3astd  on  stto  3.  jodaca  *  - ,  an<i 

;  ront-oi  :yc!t  s  oont  -n#n  ’*  '•acred  ’ir 

al  1  ■  *  1  .  C.  -  **• 


3isc-ssicrs  :n  -oaac*-q  <«...  in-  *■  - . 

'‘'t  -oaatt  or  :c-t-vjryi*»c  jai^s  < -  .  ana  can  :t 

oont  3y  e*-o  i  o  y  ’  n  g  tnt  cynaitic  coie  “anagtn'tnt  tafnicj# 

’  :r  3t.-tr  .oaatmq  scntwts .  %»  .ccaf-g  :cne-t  -• 

jito  'or  simuiat’ons  to  :t  :ist-scea  -n  trt  -tat  stct-cn  s 
to  :suo i t  *ntn  *  -  tro  to  -educs  -V 

Mntfi  •-  ;  »  - .  .  Sucn  1  sen  trnt  'S  ewioiorta  or*"ar’  f  *  or 
■ts  east  ,fi  mo  i  emtntat'  on .  3ur  "iii  :o;tct*vt  '*fi  s  to 
ctmonstratt  tnt  :oc*nt*ai  ma  tnt  caoaoi'  ty  or  '.'•»* 

/arymg  control  er  *.a  /i«*j  **or?  sat*  s*  actor*/  'sssens*  tnan 
tnat  or  a  ■  nvar •  ant  :ontro:'er 

i a amo  1  *  and  !;*"«. at*  on  ;es«  .  ts 

A  3 - j o i n t  'ooot  •'amouiator  cncwn  •  n  *-g.  s  -saa 
as  an  examoit  to  evauata  tnt  ctr’ormances  :f  :cnt-o'*er 
Jtsign  conceots  ctscr’Otd  '»  tvs  oaoatr 

*ht  3-;omt  -aniouiator  ;r  r;g.  ’  consists  Of  :nt 
-evoiuta  joint  and  two  sr’smatts  joints,  “'it  '■ctiry  ;jin? 
rctatts  n  tnt  * *y  3. an*  joint  .ar*aois  :  i .  >t  or  tnt 
or'smatic  joints  allows  tnt  nanc  to  axttnc  •**  tnt  <-*/  o  •  ant 
jjjiflt  /ar*aolt  */  woii*  :nt  otntr  *cs  tnt  •'a no  t-sns.act 
*iong  tn*  »-axis  (joint  /ariaoit  :).  "nt  -ooette  eouat’.ons 
and  control Itr  otsign  oastd  :n  'ttooacK  i near  * citt on  ana 
tjmt-varymg  'ttooac^  gains  ar«  ;m  retd  ■,tr«  out  to  acx 
soact.  ”ht i r  stcai  Is  can  3t  ’tuna  n  *?3J.  'or  t."t  *o«- 
T owing  oaramtetrs:  ^  t*-n.  :  ?  *aOC?w  (or  W) 

, «»,  2.  3  'or  torcut  or  'orct  :{mits,  simutatta  t.lt 
jo*nt  nottons  'r-m  **<3ria.  to 

s«-0.5rad.  Hit  rtsuits  art  snown  -n  naoit  !  a^d 

figures  2.  3.  ana  i.  Titst  ntjyits  -ntn  co.ooar-a  -»tn 
tnost  j*i*q  comoutaa  torcut  taermeut'  or  -ttccacx  '  intar- 
’tatton  witn  constant  statt  'etcoacx  snow  -iucn  •■^crcvtmtnts 
in  convtrgtnct  ttm%,  sceaoy-stit#  accuracy  and  control 
tf forts.  (St#  ’adits  2  and  3.5 

Ac> "tw i eocr^tnt 

i j  -estaren  -as  suooorred  '*  :art  ty  tnt  2f4lz* 
of  Naval  5tstarcn  jndtr  iontract  iGCC I -;-C5 . 

Strtrtncts : 


?au i .  5  • 


>i : 


3raav.  *  .  J.  -oHtroacn. 
.ozano-*;*f‘»i  mo  ’•ison 

ana  ';-*•**■  ,  'ht  ^IT  3,*tss,  '99< 


L .  .fln«icn  , 


C1] 

CJI 


Jtjczy.  A.  -’ooot  Arm  dynamics  and  ront-oi,1*  Jtt 
Sroouision  ^aooratory.  ’ecn.  itoort  J3-oo9.  -to.  (73* 

/uaooratov  i  c .  *.  mo  3.  itsa’C.  Tcn^II  •  -y 


1  on  -ocots.  ior*ng«r  .'tr-aq.  itr^ 


o«ui.  3.  P  .  '*-oat'‘,,g.  ’-ijectory  Jaicu  at*:n  ina 
Jtrvo’ng  of  1  (emouttr  J;nt"Oi  ea  Arm.'  ltan*2ro 
Arv'*tiai  ntt  1  (  ’  gtnet  .acoracory,  itamoro  -my.. 
AJW  77.  97C. 
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innerm-’  y  : _ Three  joint  mar 

The  dynamic  equations  of 


can  oe  written  as: 


b, 


m. 


here 


2 


u  (x)  =  — 
J  b3 


r.  >  .  ; 


SysCem  (Al-^  can  be  expressed  as: 

x  =  X^(x)  +  X^(x)u.(x)  +  X-(x)'j_,(x)  *  X  2  ( x  )u  „  f  :< )  (Ai2! 

I  -  J 

where 


X°  - 


-»2  ' 

‘o' 

V 

r\ 

J 

f .  (x) 

1 

0 

0 

:<4 

l 

0 

0 

ft 

0 

,  X*  = 

,  x-  = 

t 

,  A  = 

f^Cx) 

0 

\ 

r\ 

J 

:<6 

0 

0 

0 

-  0  . 

_0_ 

ft 

> 

Uj(x),  u^Cx),  u^Cx)  are  Che  concrols  and  ?  =  0  is  the  equilibrium 
point  of  (A5  j ■ 

We  check  again  the  Kronecker  indices. 

2. L  =  dim  R1  =  dim  { X 1  ( P )  ,  X2(P)  ,  X 3 ( P )  }  =  3 

Let 

R2  =  (X1  (P) ,(adX°  .X1 )(?) ,X2(?) ,(adX°  ,X2)(P) ,X3(?) ,(adX°  , X 3 ) ( ? )  } 


$  l 


2  f—  -<  -2^,2 

S  <1  3  y'i 


(adX°,X2)(P)  = 


xlx2  3  x" 

-2b.  — ——  *  i-  b,  — 

2  C1  "  “  C1 


(adX°,X3)(P)  =  (0, 0, 0,0,1 ,0)' 


2  .2  1 

Hence  dm  R  =6  and  3  dim  R  -  dm  R  =  3  , 

4 


L.  =  0 

i 


i  >  3 


The  Xronecker  indices  are  given  bv: 


K1  =  K2  =  K3  =  2  ’ 


X.  =  0 

l 


Thus  conditions  1  and  3  of  the  feedback  theorem  are  satisfied. 


Also  C(P)  =  R~ .  Thus,  dim  C(?)  =  6,  which  makes  condition  3  true. 


System  (A5)  is  then  feedback  equivalent  to  the  following 


linear  canonical  form  '(A  1 3  i  : 


where  u ^ ,  u?  and  y ,  are  new  free  controls.  Again,  the  real 
valued  functions  h^,  h^  are  chosen  as: 

h  j (x)  =  x j 


h^x)  = 


h,(:<)  =  x . 
3  .  5 


which  leads  to  an  identity  coordinate  change.  The  feedback 
controls  u^ ,  u^  and  u^  are  given  by: 


UjCx)  =*  -  fj(x)  +  Kj  (wL  -x ^  )  -  K  Xj 


u2(x)  *  -  f^Cx)  +  K_j  ( —x 3 )  -  K^x^ 


u,(x)  3  K_(w  -x,)  -  K  x, 

J  J  J  J  0  0 


The  explicit  control  forces  F  F  and  torque  T  are  obtained 

R  z 


from  v  AS  )  ,  (All),  and  ( A  T  0 ) 


FR  =  bl^~  +  b3§  si-n  x3  lA  : 

T  =  c  Cx )  [  —  f.(x)  +  X_(v  -x,)  -  K,  x,  !  *  b,  zx, cos  x, 
12  333  «■  + '  j  '  i  3 

-  -j  b4g  cos 

F  =*  b,[K-(v„-x_)  -  K,x,  1  1 : 

2  o'*  5  o  o  5  o  - 

These  control  forces  F  ,  F  and  control  coroue  T, 

t\  2 

transform  the  nonlinear  system  (robot  model)  into  the  linear 

canonical  system  ( A 1 3 )  . 


J 
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ABSTRACT 

This  paper  describes  some  of  our  thoughts  on  future  control 
.resign  strategies.  The  controller  design  is  viewed  as  a  general 
decision  malting  process.  To  control  complex  systems  where 
precise  dynamical  models  are  not  available,  intelligent  control 
•nouid  be  used.  An  intelligent  controller  is  a  knowledge-based,  rule- 
based.  hierarchical  controller  with  variable  structure.  It  optimizes 
multiple  objecnves  with  constraints.  It  facilitates  its  decision  making 
by  control  enrichment  and  observation  enrichment.  It  has  memory 
and  is  capable  of  learning  from  its  experiences.  Heuristic  and 
cuaiitauve  rules  are  considered  to  be  more  important  than  quantitative 
and  deductive  methods  in  such  decision  making.  As  such,  input- 
output  tcause-effect)  characterization  of  dynamical  system  is 
emphasized.  Design  of  such  intelligent  controller  calls  for  the 
possible  mtegranon  of  artificial  intelligence  and  operational  research 
with  the  control  theory.  Finally,  it  is  suggested  that  integrated 
control  design,  which  considers  jointly  the  plant,  actuators,  sensors 
and  controller  at  the  outset  of  control  design,  should  replace  the 
current  approach  of  designing  a  controller  for  a  given  plant. 


I.  Introduction 

The  majority  of  control  design  theories  today  require  rather 
precise  information  on  the  parameters  of  the  plant  to  be  controlled. 
Most  of  the  controller  design  is  based  on  the  system  parameters  such 
as  poles,  zeros  or  eigenvalues.  In  the  case  of  linear  systems,  very 
often  we  begin  with  the  system  description  of  As  -t-  Bu,  y  *  Cs 
♦  Du  or  its  transfer  tunction  G(s).  The  desired  system  response  is 
characterized  in  terms  of  desired  pole  -  zero  pattern,  desired 
eigenvalues,  or  opumizauon  of  performance  index.  Controller  is 
ihen  designed  through  the  use  of  the  pole-zero  cancellation  [1-21. 
loop  shaping  12-31.  state  feedback  (4-0)  or  performance  index 
optimization  |7-8|.  In  almost  every  case,  the  precise  information  of 
the  plant  parameters,  such  as  poles  and  zeros  of  G(s)  or  the  matrices 
A.  B  and  C,  must  be  known  in  order  to  design  the  controller.  In  the 
case  when  plant  parameters  are  not  known,  system  identification  ( 5] 
must  be  done  first  before  the  controller  design  can  proceed.  In  the 
frequenacy-domatn  design,  pole-zero  information  on  the  plant  is 
needed  for  pole-zero  cancellation  and  loop  shaping.  In  the  time- 
domain  design,  matrices  A.  B  and  C  are  needed  for  checking  the 
controllability  and  observability  and  for  computing  the  gam  matrices 
for  state  feedback  and  state  estimator.  They  are  also  needed  in  the 
solution  of  matrix  Riccan  equation  in  the  optimal  control  design.  In 
other  words,  explicit  and  precise  information  of  system  parameters  is 
indispensable  in  most  currently  available  controller  design  methods 
In  practice,  many  control  systems,  such  as  robotic  systems  in 
manufacturing,  the  discrete-event  systems  in  production  lines,  the 
large  structure  control  in  space  and  the  socio-economic  systems,  etc. 
often  lack  precise  dynamical  models  due  to  complexity  of  the 
system,  high  degree  of  nonlinearity,  large  uncertainty,  large 
parameter  variations  and  time-varying  parameters.  It  is  becoming 
more  apparent  that  conventional  controller  design  theories  are  either 
inadequate  or  impractical  for  dealing  with  control  of  such  systems. 
Therefore,  new  thinking,  new  approaches,  and  possibly  new  tools 
will  be  needed  if  we  are  going  to  control  complex  systems 
effectively.  With  the  advent  ot  learning  theory  in  psychology, 
artificial  intelligence  in  computer  science,  operational  research  in 
optimization  and  the  generalization  of  control  theory  into  a  broader 


decision-making  theory,  a  new  field  of  intelligent  control  has 
emerged  (8-9).  This  profound  change  in  control  concept  undoubtlv 
will  drastically  change  our  future  control  design  strategies  In  this 
paper,  we  would  like  to  present  some  ot  our  ihoughts  on  control 
design  issues. 

II.  An  Inspiration 

Before  we  proceed  to  discuss  various  control  strategies,  let 
us  first  take  a  look  at  a  well  known  control  problem  in  our  daily  life, 
namely  driving  of  a  car.  In  car  driving,  the  human  is  the  controller 
In  fact,  it  is  an  intelligent  controller.  When  we  decide  to  go  from 
point  A  to  point  B  by  way  of  car,  we  first  find  out  when  we  have  to 
get  there  and  then  cry  to  drive  there  the  best  way  we  can.  This  may 
sound  easy.  However,  it  needs  the  best  driving  skill  from  an 
intelligent  driver.  When  we  say  "the  best  way  it  means  somewhat 
differently  from  what  is  conventionally  used  in  optimal  control 
theory.  To  go  from  A  to  B  in  "a  best  way"  may  mean  to  take  the 
shortest  path  to  get  there  in  minimum  time,  with  mirymum  energy 
expenditure  and  to  have  a  smooth  and  most  enjoyable  drive,  etc.  In 
other  words,  intelligent  control  often  has  to  deal  with  fuzzy  system 
requirements,  to  work  with  multiple  objectives  and  to  optimize 
performance  indices  which  may  not  be  optimal  in  strict  mathematical 
sense.  To  choose  the  shortest  path,  the  driver  must  know  how  many 
different  paths  are  available  and  their  respective  lengths.  To  drive 
there  in  minimum  nme.  he  needs  to  go  at  maximum  speed  with 
constraints  on  speed  limit  and  traffic  and  road  conditions.  To  drive 
with  minimum  energy  he  needs  good  driving  control  of  the  car.  So 
an  intelligent  controller  should  have  a  good  task  planning  if  it  is 
going  to  have  good  control  strategy.  To  have  good  task  planning,  it 
needs  good  knowledge  base  which  comes  from  instructions  or 
acquires  through  learning  experiences.  As  to  the  actual  control  of  the 
moving  vehicle,  a  good  control  law  generated  by  the  controller  ithe 
driver)  is  needed.  With  the  current  control  method,  we  would  have 
to  have  a  dynamical  model  with  known  parameters  for  the  car  we  are 
driving,  i.e.  some  differential  equanons  characterizing  the  dynamics 
of  the  car  on  the  road.  With  this  parameter  information,  we  will  then 
figure  out  the  gain  matrix  for  the  state  feedback  or  ihe  controller 
poles  and  zeros  so  that  a  satisfactory  control  law  can  be  generated. 
As  we  know,  when  we  drive  we  never  bother  to  find  out  what  are 
the  dynamical  equanons.  (the  state  equanons  or  the  transfer  funenoni 
we  are  working  with.  If  we  had  this  mformanon.  we  might  be  able 
to  drive  the  car  simply  with  the  gas  pedal  control.  If  this  were  the 
case,  then  very  few  of  us  will  be  able  to  be  a  sansfactory  driver 
i  because  the  control  strategies  will  be  too  complicated! ).  Of  course, 
ihe  car  was  not  designed  to  be  controlled  by  the  gas  pedal  alone 
(even  though  the  speed  and  direction  of  the  car  can  be  controlled  by 
the  gas  pedal  alone  if  we  know  the  dynamical  equations  which 
accounts  for  the  slope  or  curvature  of  the  road)  As  we  have  seen, 
the  single  input  control  can  be  very  ineffective  in  a  complex  control 
environment.  To  make  control  strategies  simple  and  effective,  we 
should  expand  our  means  of  controlling  ihe  system  In  the  case  of 
ihe  car,  we  incorporated  the  brake  and  the  sieenng  wheel  The  brake 
is  to  control  the  vehicle  speed  more  effectively  hv  providing  last 
deceleration,  and  for  deceleration  only.  The  sieenng  wheel  allows 
effective  change  of  vehicle  direction  without  the  need  of  resortine  to 
proper  acceleration  and  deceleration  which  require  complex 
computations  involving  ihe  dyanmical  equations  and  ihe 
disturbances.  The  above  observations  suggest  that  ettecove  control 
can  be  achieved  more  etfecnvelv  by  expanding  the  control  capabiluv 
of  the  system,  namely  through  the  control  ennehmem  strategy  Now 
let  us  turn  to  ihe  question  ot  decision  maxing  in  ihe  controller  To 


decide  when,  and  how  much  10  apply  to  the  gas  pedal,  the  brake  or 
the  steering  wheel,  the  driver  (human  controller)  just  like  in  many 
conventional  controllers,  makes  his  decision  based  on  his 
interpretanon  of  the  error  signals  (such  as  how  close  is  the  car  from 
the  side  lines  of  the  road,  how  close  is  the  car  in  the  front,  how 
much  is  the  speed,  how  curvy  is  the  road,  etc.).  In  the  interpretation 
of  error  signals,  the  driver  focuses  his  attention  first  on  the  polanty 
of  the  error  and  then  on  the  quantitative  aspects.  In  other  words,  the 
driver  focuses  primarily  on  qualitauve  aspects  ot  the  error  signals  to 
help  him  decide  whether  he  should  step  on  the  gas  pedal  tto  speed  it 
up)  or  on  the  brake  (to  slow  it  dowm;  whether  to  tum  to  the  right  or 
to  tum  to  the  left.  The  quanutauve  control  such  as  how  much  to 
change  the  speed  or  the  dircuon,  though  important,  tends  to  be  more 
gradual  and  is  of  secondary  importance  in  nature. 

The  above  observations  suggest  that  although  both  the 
polanty  and  the  magnitude  of  error  signal  shoud  be  taken  into 
consideration  in  designing  control  strategies,  dearly  there  should  be 
a  difference  in  their  pnonty  ranking.  The  error  polanty  is  more 
imponant  than  the  error  magnitude.  There  seems  to  be  very  little 
compromise  on  error  polanty  in  decision  making.  Nevenheless, 
(here  appears  to  exist  a  great  deal  of  flexibility  for  reducing  the  error 
magnitude.  It  is  also  to  be  noted  that  generations  of  good  comco' 
law  requires  good  observations  and  feedback.  Good  dnving 
requires  good  front,  rear  and  side  views  from  inside  of  the  car. 
Therefore,  to  have  good  control  strategy,  the  controller  can  use  as 
many  sensor  feedbacks  as  possible.  This  calls  for  the  observation 
enrichment.  By  observation  ennehment.  we  mean  to  expand  our 
observation  capability.  It  does  not  necessarily  call  for  the  use  of 
extra  sensors. 

For  example,  what  we  see  in  the  rear  view  mirror  about  the 
car  approaching  from  behind  provides  not  only  the  distance 
information,  but  also  the  speed  information  (which  is  derived  from 
the  rate  the  distance  is  changing)  This  information  can  be  used  by 
the  driver  for  speeding  up  the  car  or  for  changing  lanes.  This  form 
of  indirect  feedback  information  (deriving  both  position  and  speed 
information  from  vision)  does  call  for  enhanced  information 
processing  capability.  Some  intelligent  sensor  capable  of 
information  processing  should  be  a  pan  of  the  intelligent  controller 
What  has  been  discussed  so  far,  such  as  controller  input  ennehment 
(observation  ennehment  of  the  plant),  controller  output  ennehment 
control  ennehment  of  the  plant),  pnonty  ranking  indecision  making 
i  polarity  over  magnitude,  qualitative  over  quantitative  in  the 
interpretation  of  error  signal)  are  simpler  to  address  than  the  more 
difficult  problem  of  addressing  the  issue  of  correcting  the  error 
magnitude  (the  quantitative  aspect!  As  we  mentioned  earlier,  there 
seems  to  be  a  great  deal  of  flexibility  for  reducing  the  error  even  after 
the  error  polanty  has  been  determined.  To  reduce  the  error 
magnitude  we  have  to  work  within  the  constraint  of  the  control  law. 
Too  much  and  too  little  correction  for  the  error  magnitude  is  equally 
unsatisfactory  They  may  cause  instability.  Therefore,  one  basic 
consideration  in  controller  design  is  to  ensure  the  systems  stability 
Even  if  the  control  action  results  in  a  stable  response,  too  much 
correction  will  cause  overshoot  and  too  little  action  will  have  a  very 
sluggish  response.  So  an  intelligent  controller  should  arbitrate 
stability  and  other  performance  conflicts. 

To  attain  optimal  performance  in  an  ever  changing 
environment,  the  controller  ought  to  be  time-varying  and  adaptive. 
Unfortunately,  such  time-varying  controller  can  not  be  designed  with 
conventional  methods  because  system  parameters  model  is  lacking. 
It  appears  that  a  viable  approach  will  be  a  heuristic,  rule-driven 
controller  that  can  be  fine  tuned. 


til.  Some  Thoughts  on  Control  Design  Strategies 

As  was  pointed  out  earlier,  it  is  either  impossible  or 
mpracncal  to  have  a  precise  dynamical  model  of  a  complex  system. 
Therefore  the  control  design  strategies  tor  such  systems  also  calls  for 
changes  from  conventional  approaches  which  rely  heavily  on  a 
purelv  deductive  mathemaucal  Discipline.  Given  below  are  some  ot 
our  thoughts  on  what  should  he  when  we  design  control  systems 

I  Place  less  emphasis  on  the  need  of  characterizing  the  system 
with  a  precise  mathematical  mcxlel  wuh  known  parameters 
Inslead.  attention  should  be  focused  on  a  better  understanding  of 
the  system  input/output  < I/Os  relationship  In  other  words,  ihe 
system  is  tube  characterized  more  explicitly  in  lerms  of  us  cause 
and  effects  relationsmp  More  precisely  speaking,  u  is  more 


important  to  have  a  good  understanding  ot  what  the  inputs  will 
do  to  the  outputs.  So  qualitative  11  e.  physical)  description  ot 
the  system  is  more  important  than  the  quantitative  ue 
mathematical)  description  of  the  system.  Mathemaucal  model  is 
needed  mainly  for  the  basic  operation  of  the  system,  and  hence, 
better  to  have  the  model  as  simple  as  possible.  As  such,  lumped 
I/O  characterization  (e  g.  quaditaove  cause-effect,  relauonshtp  or 
transfer  function  I  will  be  preferred  to  the  state  space 
characterization. 

2.  Single-input  single-output  iSISO)  system  design  should  be 
replaced  by  multiple-input,  mulnple-ourput  i.MIMO)  design  if 
needed  through  control  ennehment  and  observation  ennehment 
of  the  system.  It  is  well  known  chat  an  uncontrollable  or  an 
unobservable  system  can  be  made  to  be  controllable  and 
observable  through  expansion  of  control  capability  or 
observation  capability  The  multiple  controls  should  be  made  to 
be  as  independent  as  possible.  This  control-decoupling  propenv 
will  render  more  transparent  I/O  characterization  and  hence 
facilitate  the  decision  making  of  the  controller.  The  muitiple- 
obervations  of  the  system  should  be  closely  related  to  allow 
accurate  and  reliable  assessment  of  the  system  state  for 
intelligent  decision  making.  In  some  applications,  remote 
control  and  remote  i  non-contact  I  sensing  (as  in  space,  nuclear 
and  robots  applications!  should  also  be  explored.  Indirct 
sensing  ti  e.  multiple  information  obtained  from  signal 
processing  of  a  single  observed  informauon.  e  g.  vision  may 
provide  position  as  well  as  velocity  informauon)  may  provide 
versatile  and  effecuve  observation 

3.  Future  controllers  can  be  an  intelligent  controller.  The 
controller  will  have  data  base  for  storing  commands  ion  task 
requirements)  and  rule-based  algorithms  i either  preprogrammed 
or  accumulated  from  expenences).  It  has  memory  for  storing 
past  information  for  learning  from  expenences.  The  decision 
making  of  the  controller  wdl  be  an  expen  system  which  consists 
of  a  collection  of  basic  decision  makers.  The  control  law 
generated  by  the  conn-oiler  is  a  collective  effort  of  each 
individual  decision  maker  whose  pamcipanon  or  not  is  decided 
intelligently  in  a  time-varying  manner  based  upon  rule-based 
interpretations  of  the  observed  intormatton  and  system 
requirements. 

4  43uied-based  decision  making  will  be  a  combination  of 
qualitative  i logical)  and  quantauve  icompuianonal  i  nature  A 
mixture  of  heuristic  as  well  as  deductive  decision  processes.  In 
a  hterachtcal  decision  process,  the  qualitauve  decision  ti  e 
which  way  the  correction  of  error  has  to  be)  will  have 
precedence  over  that  of  quanutauve  decision  ti  c.  how  much  the 
error  correction  is  to  be  made).  The  quantitative  decision  will 
be  a  time-varying  process  Instead  of  being  a  continuously 
varying  process,  a  more  pracncal  wav  appears  to  be  piecewise- 
constant  nme-varying  process.  In  other  words,  the  parameters 
of  change  are  a  set  of  finue  values  and  iheir  values  will  be 
changed  at  discrete  instant  of  times  decided  by  rule-based 
algorithms. 

3.  Rule-based  algorithms  are  designed  ro  ensure  the  stability  of 
the  system  and  are  used  to  arbitrate  conflicting  performance 
requirements  over  different  intervals  of  overall  operanon  for 
maximizing  the  total  performance. 

6.  An  intelligent  controller  must  be  capable  of  learning  from 
expenences  and  of  opumizing  performances  in  a  given  set  of 
conditions.  Therefore,  an  intelligent  controller  will  be  a 
combined  effort  of  applying  results  from  Artificial  Intelligance 
(Al).  Control  and  Signal  Processing  (CSP)  and  Operational 
Reserach  (OR). 

7  The  advancements  of  computer  technology  are  making 
intelligent  controllers  more  feasible. 

5  Integraied  system  design  concept  should  be  developed.  To 
design  an  intelligent  control  system  considerations  for  plant, 
control  capability,  (actuator  technology)  observation  capability 
(sensor  technology),  and  controller  decision  making  all  need  to 
be  addressed  at  the  outset  Conventional  control  approach  ot 
starting  wuh  a  given  plan!  and  then  working  to  find  a  needed 
compensator  (controller)  ro  ensure  satisfactory  performance 
should  be  discouraged  Instead,  we  should  alwavs  ask  ihe 
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question  of  "how  to  have  the  "best"  control  system  (  best 
system  is  not  meant  to  be  an  optimal  system  in  the  traditional 
'<n se)  for  the  tasks  to  be  accomplished?" 

9.  We  need  better  understanding  on  learning  theory,  time- 
varying  and/or  nonlinear  control  theory. 

Shown  in  figures  1  and  2  are  possible  configurations  of 
control  systems  with  intelligent  controllers. 

IV.  Examples 

Given  below  are  two  examples  on  controller  design  based  on 
combination  of  heuristic  as  well  as  deductive  reasoning.  They  are 
found  to  produce  much  better  performances  than  those  designed  with 
conventional  design  method. 

Figure  3  shows  the  step  response  of  a  double-integrator  that 
Traduces  a  fast  response  without  causing  any  overshoot.  The 
.ontroller  is  a  simple  gain  scheduling  scheme  designed  by  dynamic 
note  placement  argument  (101  It  is  based  on  good  qualitative 
understanding  of  the  effects  of  velocity  loop  (that  affects  the 
lamping)  and  the  position  loop  (that  affects  the  bandwidth)  on  the 
system  response. 

Figure  4  shows  the  improvement  of  convergence  of  a 
learning  controller.  The  controller  incorporates  a  combination  of  two 
separate  learning  algorithms  operating  over  two  different  intervals  of 
operations.  The  learning  algorithm  1  applies  only  to  linear  system 
1 1 1  i  which  diverges  when  applied  to  a  nonlinear  system.  The 
leuring  algorithm  2(12]  applies  to  some  nonlinear  system,  but 
converges  slowly.  By  using  learning  algorithm  1  for  start  up  and 
then  switching  to  algorithm  2.  it  is  found  that  convergence  rate  for 
nonl.near  system  improves  significantly 

More  and  more  intelligent  controllers  using  expen  systems 
have  begun  to  emerge.  Real-time  expen  systems  for  desalinization 
system  and  robot  ping-pong  game  have  been  reponed  recently  by 
Reliable  Water  Inc.  and  AT&T  respectively  Without  any  doubt, 
intelligent  control  will  be  used  more  and  more  in  the  future  for 
control  of  complex  systems. 
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Abstract 

The  case  of  controlling  dynamical  systems  with 
unknown  or  partially  known  dynamics  which  are  required 
to  repeatedly  track  a  desired  trajectory  over  a  finite  tune 
interval  is  of  great  interest  in  practice  One  possible  ap 
proch  to  solve  such  problem  is  the  application  of  adap¬ 
tation  anil  learning  in  automatic  systems  This  paper 
presents  a  method  of  designing  a  learning  controller  based 
,*n  the  representation  ol  the  input  output  signals  by  a  set  of 
orthogonal  functions.  Two  theorems  concerning  the  learn 
ing  strategies  in  the  design  of  the  controller  are  given, 
They  extend  the  results  of  S.  Ariiuoto's  betterment  pro 
cesses  in  the  sense  that  some  less  restrictive  conditions  for 
convergence  are  needed.  Examples  are  *lso  given  to  illui- 
trate  the  results.  Potential  applications  to  robot  manipu¬ 
lator  control  are  also  noted. 


I.  Introduction 

In  practice,  we  often  face  the  problem  of  control 
hug  dynamical  systems  with  incomplete  or  little  apnori 
information  about  the  plant.  The  solution  is  usually  based 
•  >n  i  he  application  of  adaptation  and  learning  in  automatic 
systems  which  reduced  initial  uncertainty  by  using  the  in¬ 
formation  obtained  during  the  process  of  control.  Due  to 
che  availability  of  an  ever-increasing  computational  power, 
many  adaptive  and  learning  algorithms  have  emerged  and 
numerous  succesful  applications  have  been  reported  ll{  and 
l (II  Recently  a  learning  control  algorithm  called  better¬ 
ment  process  was  proposed  by  S.Arimotoet  aj.  1 2 1  to  solve 
he  particular  but  in  practice  very  important  case  of  con¬ 
trolling  sy terns  with  unknown  or  partially  known  dynamics 
which  are  required  to  repeatedly  track  a  desired  trajec¬ 
tory  The  iterative  approch  taken  by  Arimoto  et  al.  is 
based  on  a  simple  algorithm  which  updates  the  input  to 
the  swem  based  on  the  previous  input  and  a  function  of 
the  difference  between  the  previous  output  and  the  desired 
trajectory.  However,  this  method  is  limited  to  a  certain 
rlasses  of  systems  and  is  applicable  under  certain  rather 
restricted  conditions.  In  this  paper  an  alternative  approch 
based  <»n  the  representation  of  input-output  signals  by  a 
set  of  orthogonal  functions  is  presented.  Orthogonal  func- 
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tions  such  as  the  complex  exponentials,  the  Walsh  func¬ 
tions  etc...  are  attractive  pnmarly  because  of  their  poten¬ 
tial  of  signal  characterization  with  definite  advantages  in 
computational  aspects  111 


2.  Main  results 

In  what  follows  we  shall  consider  the  linear  time 
invariant  single  input  single-output  system  defined  by  v 
=  Lf  u  |  where  u,  y  are  the  system  input  and  sytem  output 
respectively  and  L  is  a  linear  operator  We  also  consider 
the  following  iteration  formulas  for  input  in  the  learning 
process: 

u*-mH  )  =  Ufcll  )  -  1  111 


“*♦1(0  =  “*(<)  •*-  *  ~{r4|f  If 


and  the  error 


ffclf )  =  y*(tl  -  y*(  M 


where  ut(<)  and  y*U  |  are  respectively  the  input  and  the 
output  of  the  system  for  the  k  Hi  iteration.  yd[t)  19  s„ine 
given  function  and  -v,  6  are  some  constants  The  problem 
is  then  to  find  conditions  that  ensures 

y*(f>  — •  ydffJ  as  k  — •  to 


in  some  sense,  for  all  t  i.e 


•  ac  for  all  t 


for  some  chosen  norm. 

Definition 

The  function  fit)  ~  defined  in  the  in¬ 

terval  it  i,  til  is  said  to  be  an  <  approumation  of  a  given 
function  f|t  |  on  the  interval  if, ,  with  respect  to  a  chosen 
set  of  orthogonal  functions  described  by  the  vector  vajued 
function: 

+  =  ,  d„|f  l! r 

if  for  a  given  positive  i  there  exiftf  a  number  N  such  that 
for  ail  t  in  jt!, I, | 

1/  -  /'I  ■  ,  for  n  ■  V 

la  this  paper  i|9|l  denotes  the  L.’  norm  „(  a  square 
intrgrabie  function  g|t)  defined  on  the  interval  by 


'  a  •  w  *  .  C, 
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Lemma 

Let  the  real  valued  function  f(t)  be  aquare- 
integrable  on  the  interval  then  the  t  -approximation 

of  f(  1 1  is  given  by 


fit)  =  V*  <*,$,! 


4.  It  has  been  shown  l2j  that  using  the  feedback  lin¬ 
earisation  technique  an  n-joint  robot  manipulator  can 
be  transformed  to  a  set  of  n  decoupled  second  order 
linear  time-invariant  systems,  therfore  the  learning  al¬ 
gorithm  presented  in  this  paper  can  also  be  applied  to 
a  certain  class  of  nonlinear  systems,  such  as  robot  ma¬ 
nipulator  control  systems. 


=  >i/4 1  i  =  l,2 . V  (1.6) 

*>.==/  /in  o* m  dt 

"r>  _ 

k,  =  I  <5 -'(nil 

/  ti 

The  maun  results  o(  the  learning  process  will  be  given  m 
the  following  two  theorems  without  proof.  The  proof  can 
be  found  m  91. 

Theorem  1. 

Let  y^(f)  be  a  given  desired  trajectory  defined 
over  a  time  interval  jf».fjJ  and  let  Cfc(t)  be  as  defined  in 
•  111  and  ei,(t|  be  its  t  approximation  with  respect  to  $ 

A  iso  let 

L|*)  =  P*  1 1.8) 

wnere  P  is  a  .V  *  .V  constant  matrix,  and  L  is  the  linear 
operator  representing  the  system.  Then 

)e*f|  — •  0  as  k  — •  x>  (1.9) 


3.  Examples 

The  illustration  of  the  two  theorems  presented  in  this 
paper  is  given  in  the  following  two  examples  : 

Example  1 

The  applicability  of  the  result  of  theorem  l  is  illus¬ 
trated  iu  this  example.  Consider  the  first  order  linear  time 
invariant  system  described  by  the  following  transfer  func- 


P(sj  =  100 - 

a  -  0.9 

For  easy  checking  with  analytic  result  let  the  desired  out 
put  y*!  i )  be< 

yd(i)  =  3.46410/  -  0.73205 
which  can  also  be  written  as  in  (1.5) 

y«i(f )  =  C|0|(()  +  citfitf ) 


if  and  only  if 


>  |.V  -  UllPUV 


where  ■'  P\ J-  and  irP  are  llie  Forliemous  norm  and  ihe  trace 
of  P  respectively. 

It  is  noted  that  condition  (110)  is  somewhat  re¬ 
strictive  and  may  not  be  satisfied  for  certain  class  of  linear 
systems.  This  condition  can  be  relaxed  by  the  following 
theorem. 

Theorem  2. 

For  the  problem  considered  in  theorem  l  and  a 
given  desired  trajectory  yal  f )  defined  over  the  time  interval 
1 1 , f?|,  and  for  a  given  set  of  orthogonal  functions  defined 
by  the  vector  valued  function  ♦  if  there  exists  a  constant 
V  .  V  matrix  Q  which  satisfies  the  following  condition 

,|/-t?PrV<l  ii.U) 


®i(<)  =  i 

®,(t)  =  2 v  31  -  ^3 

are  two  Legendre  functions  In  tins  caae  N  =  2  and 


99.62  -  19 

.19  99  94 


the  condition  11.10)  is  satisfied  and  the  algorithm  till 
converges  in  2  iterations  as  it  is  shown  in  figure  L 

Example  2 

Consider  the  following  system. 

z  =  At  +  flu 


ileoli  — •  0  a s  k  — •  oo 

f  omments 

1.  The  generalization  of  theorem  1  and  theorem  2  to  the 
case  of  tnuiti-dimensionaJ  linear  systems  is  straight 
forward. 

2.  The  computational  complexity  of  the  learning  algo¬ 
rithm  may  depends  on  the  types  of  the  orthogonal 
functions  chosen  in  the  (-approximation. 

If  P  is  nonsingular  then  the  best  choice  for  Q  is 


and  the  desired  output  is 


C  -  l  0 


'  •  AV.V  • 


LVV.'V 


fvTS 


ITT  u-1 

-  !  'j6.G3 

190  SO' 

2S0.0 

-2T2.10 

102.15 

2T0.6G 

-104  80 

111  0 

209  10 

-  108  bO 

230.50  j 

tor  f  in  0,  1..  Tor  this  system  the  condition  (1  10)  of  the¬ 
orem  1  i s  not  satisned.  however  by  choosing  a  set  of  4 
Legendre  functions  with  a  matrix  Q 


'  -100.58 
-176.70 


for  which  condition  ll.ll)  is  satisned.  y nO  converges  to 
M  in  4  iterations  as  it  is  shown  m  figure  2.  We  notice  also 
■.hat  since  in  this  example  CB  =  0.  the  conditions  for  the 
convergence  of  the  {'‘-type  betterment  as  has  been  shown 
m  31,  ,4i  are  not  satisfied.  Figure  i  shows  the  divergence 
of  yfc(M  from  the  desired  trajectory  y^t)  as  the  number 
of  iteration  k  gets  larger  However,  for  a  system  with  the 
same  matrices  .4  and  B  as  the  system  considered  in  this 
example  but  with  matrix  C  equal  to 


he  i.'1  type  betterment  converges  as  it  is  shown  in  figure 
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